Wireless device communications system

ABSTRACT

A device includes circuitry configured to determine feedforward and feedback coefficients for an adaptive frequency-domain decision feedback equalizer (AFD-DFE) based on previously received signals. The equalizer output is determined by applying the feedforward and feedback coefficients of the AFD-DFE to a received signal, and the feedforward and feedback coefficients of the AFD-DFE are updated based on the equalizer output.

CROSS REFERENCE TO RELATED APPLICATIONS

The present application is a Continuation of Ser. No. 14/883,348, allowed, having a filing date of Oct. 14, 2015 and claims the benefit of the filing date of U.S. Provisional Application 62/167,656 filed on May 28, 2015, the entire contents of which are incorporated herein by reference.

TECHNICAL FIELD

The present disclosure relates to equalization of Single-Carrier Frequency Division Multiple Access) SC-FDMA signals via adaptive frequency-domain decision feedback equalizers (AFD-DFEs).

DESCRIPTION OF RELATED ART

The “background” description provided herein is for the purpose of generally presenting the context of the disclosure. Work of the presently named inventors, to the extent it is described in this background section, as well as aspects of the description which may not otherwise qualify as prior art at the time of filing, are neither expressly or impliedly admitted as prior art against the present invention.

Orthogonal Frequency Domain Multiple Access (OFDMA) is a multiple access technique which has been adopted in many standards like ETSI DVB-RCT as described in European Telecommunications Standards Institute (ETSI), “Digital video broadcating (DVB); Interaction channel for Digital Terrestrial Television (RCT) Incorporating Multiple Access OFDMA,” ETSI EN301 958, vol. 1.1.1, 2002, D. Prendergast, B. Caron, and Y. Wu, “The Implementation of a Return Channel for ATSC-DTV,” IEEE Transactions on Broadcsting, vol. 53, pp. 521-529, June 2007, WiFi as described in IEEE Std 802.11-2007, “IEEE Standard for Information technology-Local and metropolitan area networks-Part 11: Wireless LAN Medium Access Control (MAC) and Physical Layer (PHY) Specifications,” 2007, and WiMAX a described in IEEE Std 802.16-2009, “IEEE Standard for Local and metropolitan area networks-Part 16: Air Interface for Fixed and Mobile Broadband Wireless Access Systems,” 2009.

Single-Carrier Frequency Division Multiple Access (SC-FDMA) is a multiple access technique which has been adopted in wireless broadband communication systems such as the Long Term Evolution (LTE) standard described in 3GPP, “Physical Channels and Modulation,” ETSI TS 136.211 V10.0.0, 2011. SC-FDMA has comparable complexity and performance to that of the Orthogonal Frequency Division Multiple Access (OFDMA) as described in Q. Wang, C. Yuan, J. Zhang, and Y. Li, “Frequency Domain Soft-Decision Feedback Equalization for SC-FDMA with Insufficient Cyclic Prefix,” International Journal of Computer Science (IJCSI), vol. 9, no. 6, pp. 103-108, 2012, but with an additional benefit of having a low Peak Average Power Ratio (PAPR), which helps in reducing power consumption and increasing battery life in mobile terminals.

SC-FDMA is selected as a standard technique for the uplink LTE. The sensitivity analysis of SC-FDMA is reported in K. Raghunath and A. Chockalingam, “SC-FDMA Versus OFDMA: Sensitivity to Large Carrier Frequency and Timing Offsets on the Uplink,” IEEE Global Telecommunications Conference (GLOBECOM), pp. 1-6, November 2009, and it can be shown that for large Carrier Frequency Offset (CFO) the performance of SC-FDMA can get worse than OFDMA.

The use of the Frequency-Domain Linear Equalizer (FD-LE) in SC-FDMA may not be effective for frequency-selective fading channels with spectral nulls which result in significant noise enhancement. The Frequency-Domain Decision Feedback Equalizer (FD-DFE) may be a more attractive solution when dealing with these kind of scenarios.

SUMMARY

In an exemplary embodiment, a device includes circuitry configured to determine feedforward and feedback coefficients for an adaptive frequency-domain decision feedback equalizer (AFD-DFE) based on previously received signals. The equalizer output is determined by applying the feedforward and feedback coefficients of the AFD-DFE to a received signal, and the feedforward and feedback coefficients of the AFD-DFE are updated based on the equalizer output.

In another exemplary embodiment, a method includes determining, via circuitry, feedforward and feedback coefficients for an adaptive frequency-domain decision feedback equalizer (AFD-DFE) based on previously received signals, determining, via the circuitry, an equalizer output by applying the feedforward and feedback coefficients of the AFD-DFE to a received signal, and updating, via the circuitry, the feedforward and feedback coefficients of the AFD-DFE based on the equalizer output.

In another exemplary embodiment, a system includes a transmitter configured to transmit a signal through a channel; a receiver configured to receive the signal; and circuitry configured to determine feedforward and feedback coefficients for an adaptive frequency-domain decision feedback equalizer (AFD-DFE) based on previously received signals, determine an equalizer output by applying the feedforward and feedback coefficients of the AFD-DFE to a received signal, and update the feedforward and feedback coefficients of the AFD-DFE based on the equalizer output.

The foregoing general description of the illustrative embodiments and the following detailed description thereof are merely exemplary aspects of the teachings of this disclosure, and are not restrictive.

BRIEF DESCRIPTION OF THE DRAWINGS

A more complete appreciation of this disclosure and many of the attendant advantages thereof will be readily obtained as the same becomes better understood by reference to the following detailed description when considered in connection with the accompanying drawings, wherein:

FIG. 1A is a schematic diagram of a communication system, according to certain embodiments;

FIG. 1B is a schematic diagram that shows a transmission from a wireless device to a base station, according to certain embodiments;

FIG. 1C is an exemplary block diagram of a transceiver of a SC-FDMA system, according to certain embodiments;

FIG. 2 is an exemplary block diagram of an adaptive frequency-domain decision feedback equalizer, according to certain embodiments;

FIG. 3 is an exemplary flowchart of an adaptation algorithm for the AFD-DFE, according to certain embodiments;

FIG. 4 is an exemplary illustration of the structure of the channel matrix, according to certain embodiments;

FIG. 5 is an exemplary illustration of a transceiver for SFBC SC-FDMA system, according to certain embodiments;

FIG. 6 is an exemplary block diagram of an AFD-DFE for two-transmit one-receive antenna, according to certain embodiments;

FIG. 7 is an exemplary block diagram of a transceiver for a hybrid SFBC and SM SC-FDMA system, according to certain embodiments;

FIG. 8 is an exemplary block diagram of a receiver, according to certain embodiments;

FIG. 9 is an exemplary graph that illustrates computation complexity for an AFD-DFE, according to certain embodiments;

FIG. 10 is an exemplary graph illustrating performance of AFD-DFE algorithms, according to certain embodiments;

FIG. 11 is an exemplary graph of simulation results for a CRLS AFD-DFE, according to certain embodiments;

FIG. 12 is an exemplary graph illustrating performance of AFD-DFEs, according to certain embodiments;

FIG. 13 is an exemplary graph illustrating an effect of Doppler on the AFD-DFE, according to certain embodiments;

FIG. 14 is an exemplary graph illustrating an effect of Doppler on the AFD-DFE, according to certain embodiments;

FIG. 15 is an exemplary graph illustrating performance of an AFD-DFE in the presence of CFO, according to certain embodiments;

FIG. 16 is an exemplary graph illustrating embedded-SFBC and conventional-SFBC performance results, according to certain embodiments;

FIG. 17 is an exemplary graph illustrating embedded-SFBC performance results in the presence of CFO, according to certain embodiments;

FIG. 18 is an exemplary graph illustrating performance based on percentage of training signals per SC-DFMA block, according to certain embodiments;

FIG. 19 is an exemplary graph illustrating performance of AFD-DFE algorithms, according to certain embodiments;

FIG. 20 is an exemplary graph illustrating performance of AFD-DFEs, according to certain embodiments;

FIG. 21 is an exemplary graph illustrating performance of localized and interleaved AFD-DFEs in the presence of CFO, according to certain embodiments;

FIG. 22 is an exemplary graph illustrating performance of an AFD-DFE in the presence of CFO, according to certain embodiments;

FIG. 23 is an exemplary graph illustrating an effect of velocity of AFD-DFE performance, according to certain embodiments;

FIG. 24 is an exemplary graph illustrating an effect of velocity of AFD-DFE performance, according to certain embodiments;

FIG. 25 is an exemplary graph illustrating an effect of interleaving on AFD-DFE performance, according to certain embodiments; and

FIG. 26 is an exemplary graph illustrating an effect of velocity on AFD-DFE performance, according to certain embodiments.

DETAILED DESCRIPTION

In the drawings, like reference numerals designate identical or corresponding parts throughout the several views. Further, as used herein, the words “a,” “an” and the like generally carry a meaning of “one or more,” unless stated otherwise. The drawings are generally drawn to scale unless specified otherwise or illustrating schematic structures or flowcharts.

Furthermore, the terms “approximately,” “approximate,” “about,” and similar terms generally refer to ranges that include the identified value within a margin of 20%, 10%, or preferably 5%, and any values therebetween.

Aspects of this disclosure are directed to an Adaptive Frequency Domain Decision Feedback Equalization (AFD-DFE) designed for a SC-FDMA system. In certain embodiments, designing an adaptive hybrid DFE can be computationally prohibitive due to involvement of both time and frequency-domain signals. Therefore, the adaptive DFE algorithm described herein employs both frequency-domain feedforward and feedback filters. The block RLS algorithm, as described in A. H. Sayed, Fundamental of Adaptive Filtering. New york: Wiley, 2003, the entire contents of which is herein incorporated by reference, is used to update both the feedforward and feedback filters due to tracking/convergence properties but can be computationally complex due to the matrix inversion operations. Through the embodiments described herein, the structure of the matrices used in performing the block RLS algorithm can reduce the computational complexity. In addition, the design of the AFD-DFE described herein can be extended to SFBC SC-FDMA, Spatially-Multiplexed (SM) SC-FDMA and hybrid SM-SFBC SC-FDMA systems. The AFD-DFE in SC-FDMA can be more complicated in some implementations when compared to an AFD linear equalizer (AFD-LE) due to iterative procedures. However, SC-FDMA transmissions may be confined to the uplink, where processing is done at a Base Station (BS), which may have fewer power limitations than user terminals. In addition, the AFD-DFE implementations described herein may not depend on the subcarrier mapping techniques in SC-FDMA. In addition, the AFD-DFE described herein can be more computationally efficient than a channel-estimate-based DFE, as described in C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010, the contents of which are incorporated herein by reference. It can also be demonstrated that the AFD-DFE has improved performance compared to the LE and LMS-based approaches because the AFD-DFE is robust to Inter-carrier Interference (ICI) caused by high Doppler and CFO.

In SC-FDMA, equalization is performed after transforming the received signal from the time-domain to the frequency-domain using the Discrete Fourier Transform (DFT). A linear equalizer (LE) can be adopted in SC-FDMA but for severe frequency-selective fading channels, in which spectral nulls are present, the LE may not be effective as the noise in these spectral nulls can be amplified which can cause performance degradation. Therefore, a DFE may be a more attractive solution.

To improve reliability at the user terminal, a transmit diversity is employed in LTE-Advanced (LTE-A), as described in “Technical specification group radio access Network,” LTE-A transmit diversity schemes for PUCCH format 1/1a/b, RI-092340, 2009, the contents of which are incorporated by reference in their entirety. In addition, other diversity techniques can combat multi-path fading. For example, a transmit diversity scheme with two antennas was proposed by Alamout, as described by S. Alamouti, “A simple transmit diversity technique for wireless communications” IEEE Journal on Selected Areas in Communications, vol. 16, no. 8, pp. 1451-1458, 1998, the contents of which are incorporated by reference in their entirety. Although the overall throughput is not increased, the Alamouti technique has features which can be implemented. For example, the Alamouti technique is an open loop technique, which means that Channel State Information (CSI) may not be required at the transmitter, encoding and decoding complexity is reduced due to linear processing at the terminals, and overall complexity is reduced. Furthermore, the Alamouti technique achieves full spatial diversity gain at rate 1 (since two symbol are transmitted in two time slob) and failure of one antenna chain does not result in data loss, so the Alamouti technique is compatible with single antenna systems.

The scheme proposed by Alamouti is a special case of Space Time Block Codes (STBC), as described in V. Tarokh, H. Jafarkhani, and A. Calderbank, “Space-time block codes from orthogonal designs,” IEEE Transactions on Information Theory, vol. 45, pp. 1456-1467, July 1999, the contents of which are incorporated by reference in their entirety. Alamouti's STBC can be applied to the 3GPP LTE uplink over two SC-FDMA symbols and two transmit antennas. Due to single carrier transmission from both antennas, the low PAPR property of SC-FDMA may not be affected by signs and/or the complex-conjugate operation. Unfortunately, STBC cannot be used in SC-FDMA because for STBC, the encoding is done in pairs, therefore, the frames in SC-FDMA include an even number of SC-FDMA symbols, which may not be guaranteed in LTE systems (referred to as the orphan symbol problem). In some examples, the frames contain an odd number of SC-FDMA symbols. Moreover, in STBC it can be assumed that the channel remains constant for two SC-FDMA symbols, which may not be valid in cases of fast varying channels, which can result in performance degradation.

An alternative scheme to STBC is Space-Frequency Block Codes (SFBC), as described in K. Lee and D. Williams, “A space-frequency transmitter diversity technique for OFDM systems,” IEEE. Global Telecommunications Conference (Globecom), vol. 3, pp. 1473-1477, 2000, the contents of which are incorporated by reference in their entirety. SFBC is applied within each A-point DFT block. Therefore, SFBC may not suffer from the orphan symbol problem faced by STBC when used in SC-FDMA and the number of symbols in each frame may not be restricted to an even number. As SFBC is applied to each SC-FDMA symbol, the size of the block is even which may be guaranteed in LTE systems. However, one drawback of SFBC is that the low PAPR property is affected due to frequency inversions between successive subcarrier as described in C. Ciochina, D. Castelain, D. Mottier, and H. Sari, “A Novel Space-Frequency Coding Scheme for Single Carrier Modulations,” IEEE 18th International Symposium on Personal, Indoor and Mobile Radio Communications, pp. 1-5, September 2007, the contents of which are incorporated by reference in their entirety. Therefore, a scheme is proposed in Alcatel Shanghai Bell Alcatel-Lucent, “STBC-II scheme with non-paired symbols for LTE-Advanced uplink transmit diversity,” in R1-090058, 3GPP TSG RAN WG 1 Meeting #55 bis, 2008, the contents of which are incorporated by reference in their entirety, in order to tackle the orphan symbol problem. For example, spatial repetition is used on odd symbols and STBC is applied on even symbols, which cannot achieve full diversity, and performance degrades at high Signal-to-Noise Ratios (SNR). In W. Y. Lim and Z. Lei, “Space-time block code design for single-carrier frequency division multiple access,” IEEE 20^(th) International Symposium on Personal, Indoor and Mobile Radio Communications, pp. 516-520, September 2009, the contents of which are incorporated by reference in their entirety, another approach is presented to avoid the orphan symbol problem by applying the STBC before the M-point DFT. However, in some channels performance may not be satisfactory, especially at high SNR. In certain embodiments described herein, conventional SFBC is used.

In D. Falconer, S. Ariyavisitakul, A. Benyamin-Seeyar, and B. Eidson, “Frequency domain equalization for single-carrier broadband wireless systems,” IEEE Communications Magazine, vol. 40, pp. 58-66, April 2002, the contents of which are incorporated by reference in their entirety, N. Benvenuto and S. Tomasin, “On the comparison between OFDM and single carrier modulation with a DFE using a frequency-domain feedforward filter,” IEEE Transactions on Communications, vol. 50, pp. 947-955, June 2002, the contents of which are incorporated by reference in their entirety, and H. Witschnig, M. Kemptner, R. Weigel, and A. Springer, “Decision feedback equalization for a single carrier system with frequency domain equalization—an overall system approach,” 1st International Symposium on Wireless Communication Systems, pp. 26-30, 2004, the contents of which are incorporated by reference in their entirety, a hybrid DFE structure is used for Single Carrier Frequency-Domain Equalization (SC-FDE) systems to perform equalization in the frequency-domain. In hybrid DFE, the feedforward filter is implemented in the frequency-domain while the feedback filter is realized in the time-domain. Due to frequency-domain implementation of the feedforward filter, there can be a reduction in complexity in a hybrid DFE a compared to a time-domain counterpart. In G. Huang, A. Nix, and S. Armour, “Decision feedback equalization in SC-FDMA,” 19th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, pp. 1-5, September 2008, the contents of which are incorporated by reference in their entirety, a hybrid DFE is developed for SC-FDMA to be applied for each user individually. In N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005, the contents of which are incorporated by reference in their entirety, both the feedforward and feedback filters are implemented in the frequency-domain and an iterative procedure is used to solve the causality problem.

The complexity of a DFE with both filters implemented in the frequency-domain is reduced as compared to the hybrid DFE. In C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010, the contents of which are incorporated by reference in their entirety, a framework is proposed to compute the optimum weights of the frequency-domain DFE for SC-FDMA systems and it was shown that its performance is better than the Frequency Domain LE (FD-LE) in frequency selective channels. All of the above mentioned DFE structures are non-adaptive and may require CSI at the receiver.

As equalization in SC-FDMA requires CSI, which is estimated from pilot symbols inserted in each block of data and optimum equalization is used to remove channel effects. Alternatively, a adaptive equalizer can be used which does not rely on CSI estimation and, hence, the channel estimation overhead will be reduced. Moreover, an adaptive equalizer can also help in tracking time varying channels. An adaptive LE for STBC SC-FDE system is developed and extended for multiuser scenarios in W. Younis, A. Sayed, and N. Al-Dhahir, “Efficient adaptive receivers for joint equalization and interference cancellation in multiuser space-time block-coded systems,” IEEE Transactions on Signal Processing, vol. 51, pp. 2849-2862, November 2003, the contents of which are incorporated by reference in their entirety.

Constraint-based algorithms help in achieving the better performance by putting some condition on the behavior of the algorithm. There are several examples of the constraint-based algorithms like normalized LMS, as described in D. Slock, “On the convergence behavior of the LMS and the normalized LMS algorithms,” IEEE Transactions on Signal Processing, vol. 41, no. 9, pp. 2811-2825, 1993, the contents of which are incorporated by reference in their entirety, noise constraint LMS, as described in S. Gelfand and J. Krogmeier, “Noise-constrained least mean squares algorithm,” IEEE Transactions on Signal Processing, vol. 49, no. 9, pp. 1961-1970, 2001, the contents of which are incorporated by reference in their entirety, l₀ norm constraint LMS, as described in Y. Gu, J. Jin, and S. Mei, “lo Norm Constraint LMS Algorithm for Sparse System Identification,” IEEE Signal Processing Letters, vol. 16, pp. 774-777, September 2009, the contents of which are incorporated by reference in their entirety, and noise-constraint diffusion LMS, as described in M. O. Bin Saced, A. Zerguine, and S. A. Zummo, “A noise-constrained algorithm for estimation over distributed networks,” International Journal of Adaptive Control and Signal Processing, pp. 827-845, October 2012, the contents of which are incorporated by reference in their entirety, and the like. In a time-domain DFE, the previous decisions are fed back but not the present not the present decision. To ensure that the present symbol is not canceled out, the feedback filter coefficient corresponding to that symbol is set to zero. In certain embodiments described herein, the frequency-domain version of DFE is used, therefore, to make sure that the present symbol is not canceled out in the feedback loop, and the equalizer is constrained to follow this action in order to enhance performance.

FIG. 1A is a schematic diagram of a communication system, according to certain embodiments. The communication system may include a transmitter 100 and a receiver 104. Data between the transmitter 100 and the receiver 104 is transmitted via communication channel 102. The transmitter 100 may be any electronic subsystem that produces a modulated signal. The receiver 104 obtains input samples via the communication channel 102 and performs equalization on the input samples to obtain block estimates, which are estimates of transmit blocks (transmitted by the transmitter 100). A data block may be also called a packet, a frame, and the like.

FIG. 1B is a schematic diagram that shows a transmission from a wireless device to a base station, according to certain embodiments. FIG. 1B shows a base station 250 and a wireless device 252. In an uplink transmission system, the transmitter 100 is part of the wireless device 252 and the receiver 104 is part of the base station 250. In other embodiments, the transmitter 100 may be part of the base station 250, and the receiver 104 is part of the wireless device 252. The base station 250 is typically a fixed station that communicates with the wireless devices 100. The wireless device 252 may be fixed or mobile. The wireless device 252 may be a cellular phone, a personal digital assistant (PDA), a wireless modem card, or the like. The wireless device 252 may transmit a signal to the base station 250. The signal may be a radio frequency (RF) signal. The signal may reach the base station 250 is one or more signal paths as shown in FIG. 1B. The signals may include a direct path and reflected paths. The reflected paths may be created by reflections from obstacles such as trees, buildings, or other structures. A received RF signal at the base station 250 is a superposition of all received signals at the base station 250.

FIG. 1C is an exemplary block diagram of a transceiver 150 of a SC-FDMA system, according to certain embodiments. The transceiver 150 includes processing circuitry configured to execute the processes described further herein. The transceiver 150 can be applied to localized and interleaved sub-carrier mapping methods. In some implementations, K users and M sub-carriers for each user result in a total of N sub-carriers, according to N=KM. For the m^(th) user, M data symbols are grouped to form a signal block x(m), i.e., x^((m))=[x(0)^((m)); x(1)^((m)); . . . ; x(M−1)^((m))]^(T), where T denotes the transpose operation. An M-point DFT module 152 applies the M-point DFT to the signal block to transform the signal block x^((m)) to the frequency-domain symbol, X^((m))=[X(0)^((m)); X(1)^((m)); . . . ; X(M−1)^((m))]^(T).

At subcarrier mapping stage 154, the frequency-domain symbol X^((m)) is mapped to N sub-carriers with one or more mapping techniques, such that S ^(m) =R ^(m) X ^(m) , m=1,2, . . . ,K  (1) where, R^((m)) can be an N×M resource allocation matrix for the m^(th) user. For a localized mapping schemes, R^((m))=[0_(M×(k−1)M)I_(M)0_(M×(K−k)M)], where I_(M) is M×M identity matrix with columns I₁; I₂; . . . ; I_(M), and 0_(M×M) is M×M all zero matrix. In addition, R^((m)) can be orthogonal for the users, which can be defined by

$R^{{(m)}T}R^{{(l)} =}\left\{ {\begin{matrix} {I_{M},} & {m = l} \\ {0_{M \times M},} & {m \neq l} \end{matrix}.} \right.$

At the N-point Inverse DFT (IDFT) module 156, the block S^((m)) is transformed to the time domain, s^((m)), by applying an N-point IDFT according to the equation S ^((m)) =F _(N) ^(H) R ^((m)) F _(M) x ^((m)) =F _(N) ^(H) R ^((m)) X ^((m))  (2) Where F_(N) is an N×N DFT matrix and H denotes the Hermitian (i.e., complex-conjugate transpose) operation. At cycle prefix (CP) insertion module 158, a CP of length L_(CP) is inserted into the signal, s^((m)). According to some implementations, the CP matrix can be defined a

$\begin{matrix} {C_{1} = \begin{bmatrix} 0_{L_{CP} \times {({N - L_{CP}})}} & I_{L_{CP}} \\ I_{N} & \; \end{bmatrix}} & (3) \end{matrix}$ The transmit signal is formed by converting the signal s_(CP) ^((m)) from parallel to serial, such that s_(CP) ^((m))=C₁s^((m)). The impulse response of the channel for the m^(th) user is denoted by h^((m))=[h₀ ^((m)), h₁ ^((m)), . . . , h_(L(m)) ^((m))]. In order to eliminate inter-symbol interference (ISI) from the cycle prefix, L(m) is shorter in length than L_(CP). The transmit signal can be transmitted across a channel via transmitter 160 and received at receiver 162.

At CP removal module 164, the processing circuitry removes the CP from the received signal. The CP insertion at the transmitter 160 and removal at the receiver 162 can be expressed as a circular convolution between the transmitted signal and the channel vectors. For example, the received signal after CP removal can be expressed as

$\begin{matrix} {y = {{\sum\limits_{m = 1}^{K}{s^{(m)}h^{(m)}}} + n^{(m)}}} & (4) \end{matrix}$ where

denotes the N-point circular convolution operation, and n^((m)) is a noise vector.

An N-point DFT is applied to the received signal at N-point DFT module 166 to transform the received signal into the frequency-domain and can be expressed as

$\begin{matrix} {\overset{\prime}{Y} = {{\sum\limits_{m = 1}^{K}{{\hat{\Lambda}}^{(m)}R^{(m)}X^{(m)}}} + N}} & (5) \end{matrix}$ Where {circumflex over (Λ)}^((m)) is a N×N diagonal matrix including the DFT of h^((m)) as diagonal elements, N is noise vector due to

$\sum\limits_{m = 1}^{K}n^{(m)}$ with variance of σ_(N) ²I_(N).

Then, the signal is provided to a subcarrier de-mapping module 168. After de-mapping, the m^(th) user's received signal can be expressed as Y ^((m)) =R ^((m)T){circumflex over (Λ)}^((m)) R ^((m)) X ^((m)) +N ^((m))  (6) In addition, if Λ^((m))=R^((m)T){circumflex over (Λ)}^((m))R^((m)), then Λ(m) is a M×M diagonal matrix, and simplifying the notation without the loss of generality by omitting the superscript m, equation (6) becomes Y=ΛX+N  (7) The signal is then provided to an equalizer 170. In one implementation, the equalizer 120 can be a linear equalizer (LE), and in other implementations, the equalizer 120 is a decision feedback equalizer (DFE) that includes feedforward and feedback filters. For an AFD-DFE for SISO (Single-Input Single-Output) SC-FDMA systems, let Z=diag(Y) and denote the feedforward and feedback filter coefficients in the frequency-domain as F and B, respectively. The output of the equalizer in the frequency-domain at instant k is given by

_(k) =Z _(k) F _(k−1) +D _(k) B _(k−1)  (8) The coefficients of the feedforward and feedback filters can be expressed as F(0), F(1), . . . , F(M−1) and B(0), B(1), . . . , B(M−1), respectively. Explicit knowledge of the filter coefficients may not be needed for development of the adaptive solution. The decision matrix D_(k) is defined as

$D_{k} = \left\{ \begin{matrix} {{diag}\left( {F_{M}\left( x_{k} \right)} \right)} & {{for}\mspace{14mu}{training}} \\ {{diag}\left( {F_{M}\left( {\overset{)}{x}}_{k} \right)} \right)} & {{for}\mspace{14mu}{decision}\text{-}{directed}} \end{matrix} \right.$ The weight vector of the linear equalizer (LE) can be denoted by

${W_{k} = \begin{bmatrix} F_{k} \\ B_{k} \end{bmatrix}},$ equation (8) can be expressed as

_(k) =[Z _(k) D _(k) ]W _(k−1)  (9) Hence, the output of the equalizer in the time domain is

_(k)=F_(M) ^(H)

_(k). Implementations of the equalizer 170 as an adaptive frequency-domain decision feedback equalizer (AFD-DFE) are discussed further herein.

The output of the equalizer 170 is fed to a M-point IDFT module 172 where the equalized symbols are transformed back to the time domain via IDFT. Detection and decoding can take place in the time domain.

According to certain embodiments, the AFD-DFE for SISO SC-FDMA systems uses a recursive least squares (RLS)-based implementation. In some implementations, a constraint can be applied to formulate a constraint-based least squares solution for DFE taps to be used in SC-FDMA systems, which can be referred to as recursive least squares with constraint (CRLS), which can help improve performance of the system. The constraint can be used to cancel out pre and post cursers but not the desired components of the signal. In the embodiments described herein, the constraint applied is described in N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005, the contents of which are incorporated by reference in their entirety. For example, the feedback filter weights in the time-domain may be b₀, b₁, . . . , b_(L). The filter weight b₀ that corresponds to the current symbol can be set to zero so that the current filter weight is not provided as feedback. In frequency-domain, this constraint can be translated to

${\sum\limits_{i = 0}^{M - 1}{B(i)}} = 0.$ In other implementations, the RLS approach to the AFE-DFE can also be applied to SC-FDMA systems without the use of constraints.

Derivations of AFD-DFE algorithms are described further herein, according to certain embodiments. In one embodiment, the frequency-domain RLS AFD-DFE can be developed from a least squares problem, which is referred to throughout the disclosure as Case 1 of the RLS AFD-DFE. For example, k+1 realizations of the matrices {Z, D} are available and collected as

$\begin{matrix} {\Pi = \begin{bmatrix} Z_{0} & D_{0} \\ Z_{1} & D_{1} \\ \vdots & \vdots \\ Z_{k} & D_{k} \end{bmatrix}} & (10) \end{matrix}$ and

$\begin{matrix} {\overset{\prime}{D} = \begin{bmatrix} D_{0} \\ D_{1} \\ \vdots \\ D_{k} \end{bmatrix}} & (11) \end{matrix}$ where [D_(k)(0); D_(k)(1); . . . ; D_(k)(M−1)]^(T) is a vector having the diagonal elements of D_(k). First, a combined cost function of the feedforward and feedback filters is minimized according to

$\begin{matrix} {{\min_{W}{{{\overset{\prime}{D} - {\Pi\; W}}}^{2}\mspace{14mu}{subject}\mspace{14mu}{to}\mspace{14mu}{\sum\limits_{i = 0}^{M - 1}{B(i)}}}} = 0} & (12) \end{matrix}$ where the notation ∥.∥² denotes the squared Euclidean norm of its argument. To solve the optimization problem, the Lagrange multiplier method is used for a case of complex valued data. According to the Lagrange multiplier method, the objective function has two terms, given on the right side of the equation

$\begin{matrix} {J = {{{\overset{\prime}{D} - {\Pi\; W}}}^{2} + {2{Re}\left\{ {\alpha*{\sum\limits_{i = 0}^{M - 1}{B(i)}}} \right\}}}} & (13) \end{matrix}$ where α is complex valued Lagrange multiplier and asterisk denotes complex conjugation. Since the square of Euclidean norm ∥{acute over (D)}−

∥² is a real-valued function the real part operator, Re{.}, is applied to the second term of equation (13) to ensure that the contribution to the objective function is real-valued. The objective function, J, is quadratic, as shown by expanding equation (13) into J={acute over (D)} ^(H) {acute over (D)}−{acute over (D)} ^(H) ΠW−W ^(H)Π^(H) ΠW+2Re{α*GW}  (14) The objective function J is differentiated with respect to the weight vector W to find the least squares solution For example, the real-valued function J is differentiated with respect to the complex-valued vector W to achieve

$\begin{matrix} {\frac{\partial J}{\partial W} = {{{- D}\;\overset{\prime}{\Pi}} + {W^{H}\Pi^{H}\Pi} + {\alpha^{*}G}}} & (15) \end{matrix}$ where G=[0_(1×M) 1_(1×M)] with 0_(1×M) as an all zeros row vector and 1_(1×M) as an all ones row vector of size M. The final solution of equation (15) then becomes W=(Π^(H)Π)⁻¹(Π^(H) {acute over (D)}−

)  (16) Where

=αG^(T).

The least squares solution of equation (16) can be updated recursively, according to certain embodiments. The time updated least squares problem can be represented as

$\begin{matrix} {{\min_{W}{{{{\overset{\prime}{D}}_{k + 1} - {\Pi_{k + 1}W}}}^{2}\mspace{14mu}{subject}\mspace{14mu}{to}\mspace{14mu}{\sum\limits_{i = 0}^{M - 1}{B(i)}}}} = 0} & (17) \end{matrix}$ which has the following solution: W _(k+1)=(Π_(k+1) ^(H)Π_(k+1))⁻¹(Π_(k+1) ^(H) {acute over (D)} _(k+1)−

_(k+1))  (18) To develop an update scheme for the least squares solution (18), Π_(k+1) and {acute over (D)}_(k+1) can be written as

$\begin{matrix} {\Pi_{k + 1} = \begin{bmatrix} \Pi_{k} \\ \Pi_{k + 1} \end{bmatrix}} & (19) \end{matrix}$ and

$\begin{matrix} {{\overset{\prime}{D}}_{k + 1} = \begin{bmatrix} {\overset{\prime}{D}}_{k} \\ {\overset{\prime}{D}}_{k + 1} \end{bmatrix}} & (20) \end{matrix}$ In addition, Π_(k+1) ^(H)Π_(k+1)=Π_(k) ^(H)Π_(k) +[Z _(k+1) D _(k+1)]^(H) [Z _(k+1) D _(k+1)]  (21) and Π_(k+1) ^(H) {acute over (D)} _(k+1)−

_(k+1)=Π_(k) ^(H) {acute over (D)} _(k)−

_(k) +[Z _(k+1) D _(k+1)]^(H) {acute over (D)} _(k+1)−α_(k+1) G ^(T)  (22) Also, the following matrix can be introduced

_(k+1)=(Π_(k+1) ^(H)Π_(k+1))⁻¹  (23) So that equation (18) can be written more compactly as W _(k+1)=

_(k+1)(Π_(k+1) ^(H) {acute over (D)} _(k+1)−

_(k+1))  (24) By substituting equation (21) into equation (23), the result is

_(k+1)=(Π_(k) ^(H)Π_(k) +[Z _(k+1) D _(k+1)]^(H) [Z _(k+1) D _(k+1)])⁻¹  (25) and

_(k+1) ⁻¹=

_(k) ⁻¹ +[Z _(k+1) D _(k+1)]^(H) [Z _(k+1) D _(k+1)]  (26) By applying a matrix inversion identity,

_(k+1)=

_(k)−

_(k) [Z _(k+1) D _(k+1)]^(H) β[Z _(k+1) D _(k+1)]

_(k)  (27) where β=(I_(M)+[Z_(k+1) D_(k+1)]

_(k)[Z_(k+1) D_(k+1)]^(H))⁻¹. Equations (27) and (22) can be substituted into equation (24), resulting in W _(k+1)=(

_(k)−

_(k) [Z _(k+1) D _(k+1)]^(H) β[Z _(k+1) D _(k+1) ]P _(k))(Π_(k) ^(H) {acute over (D)} _(k)−

_(k) +[Z _(k+1) D _(k+1)]^(H) {acute over (D)} _(k+1)−α_(k+1) G ^(T))=

_(k)Π_(k) ^(H) {acute over (D)} _(k)−

_(k)

_(k)−(P _(k) [Z _(k+1) D _(k+1)]^(H) β[Z _(k+1) D _(k+1)]

_(k)(Π_(k) ^(H) {acute over (D)} _(k)−

_(k))+

_(k) [Z _(k+1) D _(k+1)]^(H)(I _(M) −β[Z _(k+1) D _(k+1)]

_(k) [Z _(k+1) D _(k+1)]^(H))D _(k+1)−

_(k+1)(α_(k+1) G ^(T))  (28) After rearranging, equation (28) becomes W _(k+1) =W _(k)(

_(k) [Z _(k+1) D _(k+1)]^(H)β)(D _(k+1) −[Z _(k+1) D _(k+1) ]W _(k))−

_(k+1)(α_(k+1) G ^(T))=W _(k)+

_(k+1) [Z _(k+1) D _(k+1)]^(H)(D _(k+1) −[Z _(k+1) D _(k+1) ]W _(k))−

_(k+1)(α_(k+1) G ^(T))  (29) In addition,

_(k+1) [Z _(k+1) D _(k+1)]^(H)=(

_(k) [Z _(k+1) D _(k+1)]^(H)β)  (30) Finally, the weight vector update equation becomes W _(k+1)=

_(k+1)([Z _(k+1) D _(k+1)]^(H)(D _(k+1) −[Z _(k+1) D _(k+1) ]W _(k))−α_(k+1) G ^(T))  (31) If an exponentially-weighted RLS algorithm is used, then equation (27) becomes

_(k+1)=λ⁻¹[

_(k)−λ⁻¹

_(k) [Z _(k+1) D _(k+1)]^(H) β[Z _(k+1) D _(k+1)]

_(k)]  (32) where 0<<λ<1. The multiplier α_(k+1) can be updated according to a stochastic gradient method as

$\begin{matrix} {\alpha_{k + 1} = {\alpha_{k} + {\mu\left( \frac{\partial J}{\partial\alpha} \right)}_{k}^{*}}} & (33) \end{matrix}$ Differentiating equation (13) with respect to α produces

$\begin{matrix} {\frac{\partial J}{\partial\alpha} = {\left( {\sum\limits_{i = 0}^{M - 1}{B(i)}} \right)^{*}.}} & (34) \end{matrix}$ Therefore, equation (33) can be rewritten as

$\begin{matrix} {\alpha_{k + 1} = {\alpha_{k} + {{\mu\left( {\sum\limits_{i = 0}^{M - 1}{B(i)}} \right)}.}}} & (35) \end{matrix}$

In Case 2 of the RLS AFD-DFE for SISO SC-FDMA, the AFD-DFE can be implemented with the RLS algorithm that is applied with a stochastic gradient method and can be either constrained or unconstrained. For example, the Mean Square Error (MSE) at the i^(th) frequency bin can be represented as MSE(i)=E[|D(i)−{hacek over (X)}(i)|²]  (36) Where E[.] represents the expectation operator. In the constraint-based approach, the coat function becomes

$\begin{matrix} {J = {E\left\lbrack {\left. {{D(i)} - {X^{\prime}\left( i \right.}^{2}} \right\rbrack + {2{{Re}\left\lbrack {\alpha^{*}{\sum\limits_{i = 0}^{M - 1}{B(i)}}} \right\rbrack}}} \right.}} & (37) \end{matrix}$ Expanding the cost function and disregarding the expectation results in

$\begin{matrix} {J = {{{D^{*}(i)}{D(i)}} - {{D^{*}(i)}{y(i)}{\mathcal{F}(i)}} - {{D^{*}(i)}{D(i)}{B(i)}} - {{\mathcal{F}^{*}(i)}{y(i)}{D(i)}} - {{B^{*}(i)}{D^{*}(i)}{D(i)}} + {{\mathcal{F}^{*}(i)}{y^{*}(i)}{y(i)}{\mathcal{F}(i)}} + {{\mathcal{F}^{*}(i)}{y(i)}{D(i)}{B(i)}} + {{B^{*}(i)}{D^{*}(i)}{y(i)}{\mathcal{F}(i)}} + {{B^{*}(i)}{D^{*}(i)}{D(i)}{B(i)}} + {2{{Re}\left\lbrack {\alpha^{*}{\sum\limits_{i = 0}^{M - 1}{B(i)}}} \right\rbrack}}}} & (38) \end{matrix}$ Minimizing equation (38) for the feedforward and feedback filters separately results in the following updates at instant (k+1):

$\begin{matrix} \begin{matrix} {{\mathcal{F}_{k + 1}(i)} = {{\mathcal{F}_{k}(i)} - {{\mu_{k + 1}^{1}(i)}\left( \frac{\partial J}{\partial{\mathcal{F}(i)}} \right)_{k}^{*}}}} \\ {= {{\mathcal{F}_{k}(i)} + {{\mu_{k + 1}^{1}(i)}{y_{k + 1}^{*}(i)}\left\{ {{D_{k + 1}(i)} - \left\lbrack {{{y_{k + 1}(i)}{\mathcal{F}_{k}(i)}} +} \right.} \right.}}} \\ \left. \left. {{D_{k + 1}(i)}{B_{k}(i)}} \right\rbrack \right\} \end{matrix} & (39) \end{matrix}$ and

$\begin{matrix} \begin{matrix} {{B_{k + 1}(i)} = {{B_{k}(i)} - {{\mu_{k + 1}^{2}(i)}\left( \frac{\partial J}{\partial{B(i)}} \right)_{k}^{*}}}} \\ {= {{B_{k}(i)} + {{\mu_{k + 1}^{2}(i)}{D_{k + 1}^{*}(i)}\left\{ {{D_{k + 1}(i)} - \left\lbrack {{{y_{k + 1}(i)}{\mathcal{F}_{k}(i)}} +} \right.} \right.}}} \\ {\left. \left. {{D_{k + 1}(i)}{B_{k}(i)}} \right\rbrack \right\} - {{\mu_{k + 1}^{2}(i)}\alpha_{k + 1}}} \end{matrix} & (40) \end{matrix}$ where μ(i)_(k+1) ¹ and μ(i)_(k+1) ² are the time-varying step-sizes, given as

${{\mu(i)}_{k + 1}^{1}\frac{\mu_{k + 1}}{\epsilon_{k + 1} + {E\left\lbrack {{y(i)}*{y(i)}} \right\rbrack}}},{{\mu(i)}_{k + 1}^{2} = \frac{\mu_{k + 1}}{\epsilon_{k + 1} + {E\left\lbrack {{D^{*}(i)}{D(i)}} \right\rbrack}}}$

For the unconstrained RLS algorithm, the feedforward and feedback filters can be described by

$\begin{matrix} {{F_{k + 1}(i)} = {{F_{k}(i)} + {\left( \frac{\mu_{k + 1}}{\epsilon_{k + 1} + {E\left\lbrack {{y(i)}^{*}{y(i)}} \right\rbrack}} \right){y_{k + 1}^{*}(i)}\left\{ {{D_{k + 1}(i)} - \left\lbrack {{{y_{k + 1}(i)}{F_{k}(i)}} + {{D_{k + 1}(i)}{B_{k}(i)}}} \right\rbrack} \right\}}}} & (41) \end{matrix}$ and

$\begin{matrix} {{B_{k + 1}(i)} = {{B_{k}(i)} + {\left( \frac{\mu_{k + 1}}{\epsilon_{k + 1} + {E\left\lbrack {{D^{*}(i)}{D(i)}} \right\rbrack}} \right){D_{k}^{*}(i)}\left\{ {{D_{k + 1}(i)} - \left\lbrack {{{y_{k + 1}(i)}{F_{k}(i)}} + {{D_{k + 1}(i)}{B_{k}(i)}}} \right\rbrack} \right\}}}} & (42) \end{matrix}$

Next, E[Y(i)*Y(i)] and E[D(i)*D(i)] are replaced by corresponding estimates, which for the RLS update, are chosen as exponentially-weighted sample averages that can be described by

$\begin{matrix} \left. {{E\left\lbrack {{y(i)}^{*}{y(i)}} \right\rbrack} = {\frac{1}{\left( {k + 2} \right)}{\sum\limits_{j = 0}^{k + 1}{\lambda^{k + 1 - j}{y_{j}(i)}^{*}{y_{j}(i)}}}}} \right) & (43) \end{matrix}$ and

$\begin{matrix} \left. {{E\left\lbrack {{D(i)}^{*}{D(i)}} \right\rbrack} = {\frac{1}{\left( {k + 2} \right)}{\sum\limits_{j = 0}^{k + 1}{\lambda^{k + 1 - j}{D_{j}(i)}^{*}{D_{j}(i)}}}}} \right) & (44) \end{matrix}$

For some scalar 0<<λ<1. For example, equations (43) and (44) correspond to an average of all past regressors up to time k+1. The step size can be chosen as μ_(k+1)=1/(k+2) and the regularization factor as ∈_(k+1)=λ^(k+2)∈/(k+2), and collecting the coefficients into

equations (39) and (40) for the constrained RLS algorithm become

$\begin{matrix} {W_{k + 1} = {W_{k} + {\left\lbrack {{\lambda^{k + 2}\epsilon\; I_{2\; M}} + {\sum\limits_{j = 0}^{k + 1}{\lambda^{k + 1 - j}A_{j}^{H}A_{j}}}} \right\rbrack^{- 1}\left( {{A_{k + 1}^{H}ɛ_{k + 1}} - {\alpha_{k + 1}G^{T}}} \right)}}} & (45) \end{matrix}$ where

_(k) and ε_(k) are described by

$\begin{matrix} {A_{k} = \begin{bmatrix} Z_{k} & 0 \\ 0 & D_{k} \end{bmatrix}} & (46) \end{matrix}$ and

$\begin{matrix} {ɛ_{k} = \begin{bmatrix} {D_{k} - {\overset{\Cup}{X}}_{k}} \\ {D_{k} - {\overset{\Cup}{X}}_{k}} \end{bmatrix}} & (47) \end{matrix}$ For the unconstrained RLS algorithm, equations (41) and (42) become

$\begin{matrix} {W_{k + 1} = {W_{k} + {\left\lbrack {{\lambda^{k + 2}\epsilon\; I_{2M}} + {\sum\limits_{j = 0}^{k + 1}{\lambda^{k + 1 - j}A_{j}^{H}A_{j}}}} \right\rbrack^{- 1}A_{k + 1}^{H}ɛ_{k + 1}}}} & (48) \end{matrix}$

Computing the inverse of the matrix in (45) and (48) can be costly due to combining all previous and present data form and then invert the matrix. Therefore, the following equation can be defined

$\begin{matrix} {\theta_{k + 1}\overset{\bigtriangleup}{=}\left( {{\lambda^{k + 2}\epsilon\; I_{2\; M}} + {\sum\limits_{j = 0}^{k + 1}{\lambda^{k + 1 - j}A_{j}^{H}A_{j}}}} \right)} & (49) \end{matrix}$ which satisfies the following recursion Θ_(k+1)=λΘ_(k)+

_(k+1) ^(H)

_(k+1), Θ₀ =εI _(2M)  (50) Let

_(k+1)=Θ_(k+1) ⁻¹ and apply matrix inversion to result in

_(k+1)=λ⁻¹[

_(k)−λ⁻¹

_(k)

_(k+1) ^(H)×(I _(2M)+λ⁻¹

_(k+1)

_(k)

_(k+1) ^(H))⁻¹

_(k+1)

_(k)]  (51) where λ is a forgetting factor chosen to be a value that is close to 1. The RLS update equation for the constrained algorithm is described by

_(k+1)=

_(k)+

_(k+1)(

_(k+1) ^(H)ε_(k+1)−α_(k+1) G ^(T))  (52) In addition, the RLS update equation for the unconstrained algorithm is described by

_(k+1)=

_(k)+

_(k+1)

_(k+1) ^(H)ε_(k+1)  (53) In both (52) and (53), ε_(k) is as defined in equation (47). Initially,

₀=0 and

₀=diag (

I_(M)

I_(M)).

In some implementations,

_(k) in Case 2 of the RLS AFD-DFE algorithm estimates an input covariance matrix for each filter as the input of the feedforward filter and feedback filter, which are taken into account independently. Therefore,

_(k) in the second case of the RLS AFD-DFE algorithm may have better performance than the first case of the RLS AFD-DFE algorithm described previously herein. Also, the decision error for the second case does not affect the feedforward filter. The computational complexity for the second case is also lower than for the first case because matrix inversion is not performed. Details regarding simulation results and computational complexity for Case 1 and Case of the AFD-DFE algorithm are discussed further herein.

According to certain embodiments, due to the structure of

_(k+1) that eliminates the performance of matrix inversion, the computational complexity of the AFD-DFE algorithm can be reduced. In some implementations, the matrix P_(k+1) can have a diagonal structure, i.e., P_(k+1)=diag([P_(k+1) ¹ P_(k+1) ²]), where P_(k+1) ¹ and P_(k+1) ² are also diagonal. P_(k+1) ¹ can be written as P _(k+1) ²=λ¹ [P _(k) ²−λ¹ P _(k) ²(|

_(k+1)|⁻²+λ¹ P _(k) ²)¹ P _(k) ²]  (54) Moreover, the term (I_(M)+λ⁻¹|

_(k+1)|²P_(k) ¹)⁻¹ is also diagonal, therefore, P _(k+1) ¹=λ⁻¹ [P _(k) ¹−λ⁻¹ P _(k) ¹(|

_(k+1)|⁻²+λ⁻¹ P _(k) ¹)⁻¹ P _(k) ¹]  (55) and the matrix inversion has M scalar inversions. Likewise, the determination of P_(k+1) ² follows P_(k+1) ¹ such that

$\begin{matrix} \begin{matrix} {P_{k - 1}^{1} = {\lambda^{- 1}\left\lbrack {P_{k}^{1} - {\lambda^{- 1}P_{k}^{1}{{\mathfrak{Z}}_{k - 1}^{H}\left( {I_{M} + {\lambda^{- 1}{\mathfrak{Z}}_{k + 1}P_{k}^{1}{\mathfrak{Z}}_{k + 1}^{H}}} \right)}^{- 1}{\mathfrak{Z}}_{k + 1}P_{k}^{1}}} \right\rbrack}} \\ {= {\lambda^{- 1}\left\lbrack {P_{k}^{1} - {\lambda^{- 1}P_{k}^{1}{{\mathfrak{Z}}_{k - 1}^{H}\left( {I_{M} + {\lambda^{- 1}{{\mathfrak{Z}}_{k + 1}}^{2}P_{k}^{1}}} \right)}^{- 1}{\mathfrak{Z}}_{k - 1}P_{k}^{1}}} \right\rbrack}} \end{matrix} & (56) \end{matrix}$ Therefore, the final reduced-complexity constrained RLS update has the form

_(k+1)=

_(k)+diag([P _(k+1) ¹ P _(k+1) ²])(

_(k+1) ^(H)ε_(k+1)−α_(k−1) G ^(T))  (57) and the constraint, α_(k+1), is updated based on equation (35) In addition, the reduced-complexity unconstrained RLS update has the form

_(k+1)=

_(k)+diag([P _(k+1) ¹ P _(k+1) ²])(

_(k+1) ^(H)ε_(k+1)  (58)

While the embodiments herein describe the derivation of an AFD-DFE, the derivations can also be applied to a linear equalizer by setting the feedback coefficients equal to zero.

FIG. 2 is an exemplary block diagram of an AFD-DFE 200, according to certain embodiments. For example, the AFD-DFE 200 can be used as the equalizer 170 in the transceiver 150. The AFD-DFE 200 can be implemented in SISO SC-FDMA systems and can employ algorithms such as the Case 1 and Case 2 RLS and CRLS AFD-DFE algorithms described previously herein. At the forming data matrix module 202, the received signal

is formed into diagonal matrix

The AFD-DFE 200 uses a training mode to allow the algorithm to converge before switching to a decision-directed mode where the AFD-DFE 200 uses previous decisions to update the weighting factors of the adaptive feedforward filter 204 and adaptive feedback filter 206. For example, in the training mode, the AFD-DFE 200 receives known M-block training signal, X_(k), which is compared to the received signal after the adaptive feedforward filter 204 and adaptive feedback filter 206 have been applied according to equation (8), denoted by {hacek over (X)}_(k), at the summing module 216. In some implementations, a causality problem that can exist in equation (8) can be solved using an iterative procedure as described in K. Berberidis and P. Karaivazoglou, “An efficient block adaptive decision feedback equalizer implemented in the frequency domain,” IEEE Transactions on Signal Processing, vol. 50, pp. 2273-2285, September 2002 and V. Kekatos, K. Berberidis, and A. A. Rontogiannis, “A Block Adaptive Frequency Domain MIMO DFE for Wideband Channels,” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. III-197-III-200, 2007, the entire contents of which are incorporated by reference in their entirety. The output of the summing module 206 is an error signal that is used to update the coefficients of the adaptive feedforward filter 204 and the adaptive feedback filter 206. The error signal is generated by comparing the equalizer output with a desired response in the frequency-domain, which is applied in equations (57) and (58). In addition, the M-block training signal is used to form the decision matrix,

_(k), at the form decision matrix module 208, which is then used to update the adaptive feedback filter 206. To prevent the algorithm of the AFD-DFE 200 from diverging, such as in the case of fast fading channels, retraining blocks can also be used.

In the decision-directed mode, the AFD-DFE 200 uses the equalizer output from the previous iteration after an M-point DFT is applied at the DFT module 214 to calculate the error signal at the summing module 216. For example, the error signal is based on the difference between the output of the DFT-module 214, {circumflex over (X)}_(k), and the received signal after the adaptive feedforward filter 204 and adaptive feedback filter 206 have been applied, denoted by {hacek over (X)}_(k), according to equation (8). The adaptive feedforward filter 204 and the adaptive feedback filter 206 are updated based on the error signal output from the summing module 216. In addition, the iterated decision, {circumflex over (X)}k, can be used to produce the decision matrix

_(k), at the form decision matrix module 208, which is then used to update the adaptive feedback filter 206. The equalizer output, {hacek over (X)}_(k), is transformed to the time domain using an IDFT at the IDFT module 210, and the output signal from the equalizer 170 is denoted as {circumflex over (x)}_(k).

FIG. 3 is an exemplary flowchart of an adaptation algorithm 300 for the AFD-DFE 200, according to certain embodiments. The adaptation algorithm 300 can be applied to any of the embodiments of the AFD-DFE described herein for SISO and/or MIMO systems. In addition, Table I illustrates a summary of the adaptation algorithm 300. At step S302, the algorithm is initialized according to the following:

₀=0, α₀=0, λ set to a value approximately equal to 1, and

$P_{0} = {\begin{bmatrix} {\epsilon_{F}^{- 1}I_{M}} & 0 \\ 0 & {\epsilon_{B}^{- 1}I_{M}} \end{bmatrix}.}$

TABLE I Summary of the adaptation algorithm for AFD-DFE Initialization: Initialize the algorithm by setting  W₀ = 0  α₀ = 0  λ is close to one   ${{and}\mspace{14mu} P_{0}} = \begin{bmatrix} {\epsilon_{F}^{- 1}I_{M}} & 0 \\ 0 & {\epsilon_{B}^{- 1}I_{M}} \end{bmatrix}$ For each instant of time, k = 0, 1, 2, . . . In training mode: (1) Update α_(k+1) as  α_(k+1)= α_(k) + μΣ_(i=0) ^(M−1)B_(k)(i) (2) Update P_(k+1) ¹ and P_(k+1) ² via  P_(k+1) ¹ = λ⁻¹[P_(k) ¹ − (|Z_(k+1)|⁻² + λ⁻¹P_(k) ¹)⁻¹P_(k) ¹]  P_(k+1) ² = λ⁻¹[P_(k) ² − (|Z_(k+1)|⁻² + λ⁻¹P_(k) ²)⁻¹P_(k) ²] (3) Update the equalizer weights W_(k+1) recursively as  W_(k+1) = W_(k) + diag([P_(k+1) ¹ P_(k+1) ²])(A_(k+1) ^(H)ε_(k+1) − α_(k+1)G^(T)) In decision-directed mode: (1) Iterate on χ̆_(k+1) = Z_(k+1)F_(k) + D_(k+1)B_(k) (2) Update α_(k+1) as  α_(k+1)= α_(k) + μΣ_(i=0) ^(M−1)B_(k)(i) (3) Update P_(k+1) ¹ and P_(k+1) ² via  P_(k+1) ¹ = λ⁻¹[P_(k) ¹ − (|Z_(k+1)|⁻² + λ⁻¹P_(k) ¹)⁻¹P_(k) ¹]  P_(k+1) ² = λ⁻¹[P_(k) ² − (|Z_(k+1)|⁻² + λ⁻¹P_(k) ²)⁻¹P_(k) ²] (4) Update the equalizer weights W_(k+1) recursively as  W_(k+1) = W_(k) + diag([P_(k+1) ¹ P_(k+1) ²])(A_(k+1) ^(H)ε_(k+1) − α_(k+1)G^(T))

At step S304, a determination is made whether the algorithm is in training mode or decision-directed mode. Details regarding the training mode and the decision-directed mode are discussed further herein. If the algorithm is in training mode, resulting in a “yes” at step S304, then step S306 is performed. Otherwise, if the algorithm is in decision-directed mode, resulting in a “no” at step S304, then step S314 is performed.

In training mode, for each instance of time k=0, 1, 2 . . . , at step S306, α_(k+1) is updated according to equation (35). At step S308,

_(k+1) ¹ and

_(k+1) ² are updated according to the following: P _(k+1) ¹=λ⁻¹ [P _(k) ¹−λ⁻¹ P _(k) ¹(|

_(k+1)|⁻²+λ⁻¹ P _(k) ¹)⁻¹ P _(k) ¹] P _(k+1) ²=λ⁻¹ [P _(k) ²−λ⁻¹ P _(k) ²(|X _(k+1)|⁻²+λ⁻¹ P _(k) ²)⁻¹ P _(k) ²] where X_(k+1) represents the training signal.

At step S310, the equalizer weight

_(k+1) can be recursively updated according to equations (52) and/or (53). At step S312, the processing circuitry determines whether the algorithm has reached convergence. In some implementations, convergence is based on magnitude of an error signal comparing the difference between the equalizer output and the desired response. For example, the error signal can correspond to a difference between a training signal and the equalizer output. If the error signal is less than a predetermined threshold, then the processing circuitry can determine that convergence has been achieved. If the determination is made that the algorithm has converged, resulting in a “yes” at step S312, then the algorithm switches to decision-directed mode, and step S314 is performed. Otherwise, if it is determined that convergence has not been reached, resulting in a “no” at step S312, then step S306 is performed for the next instance of time, k.

In decision-directed mode, at step S314, an iteration of the equalizer output is determined according to equation (8). At step S316, α_(k+1) is updated according to equation (35). At step S318,

_(k+1) ¹ and

_(k+1) ² are updated according to the following: P _(k+1) ¹=λ⁻¹ [P _(k) ¹−λ⁻¹ P _(k) ¹(|

_(k+1)|⁻²+λ⁻¹ P _(k) ¹)⁻¹ P _(k) ¹] P _(k+1) ²=λ⁻¹ [P _(k) ²−λ⁻¹ P _(k) ²(|{hacek over (X)} _(k+1)|⁻²+λ⁻¹ P _(k) ²)⁻¹ P _(k) ²] where {hacek over (X)}_(k+1) represents the equalizer output signal.

At step S320, the equalizer weights

_(k+1) can be recursively updated according to equations (52) and/or (53), and the adaptation algorithm returns to step S314 to compute the next iteration of the equalizer output.

In some implementations, the adaptation algorithm 300 can be implemented in a reduced training mode. For example, instead of using M training symbols in one SC-FDMA block, the training overhead can be reduced by using T training symbols and M-T modulated symbols. The remaining weights can be interpolated by introducing an iterative procedure in training mode. Table II illustrates of a summary of the reduced training mode for the adaptation algorithm 300.

According to certain embodiments, perfect frequency synchronization is not assumed between the transmitter and receiver due to carrier frequency offset (CFO) that arises in practical SC-FDMA systems due to misalignment between transmitter/receiver frequency oscillators which causes interference, also referred to as energy leakage, from neighboring sub-carriers.

TABLE II Summary of the reduced-training adaptation algorithm for AFD-DFE For each instant of time, k = 0, 1, 2, . . . In training mode: (1) Iterate on 

_(k+1) =

_(k+1)

_(k) +

_(k+1)

_(k) with SC-FDMA block containing T training symbols and M-T modulated data symbols (2) Update α_(k+1) as   $\alpha_{k + 1} = {\alpha_{k} + {\mu{\sum\limits_{i = 0}^{M - 1}{B_{k}(i)}}}}$ (3) Update P_(k+1) ¹ and P_(k+1) ² via  P_(k+1) ¹ = λ⁻¹[P_(k) ¹ − λ⁻¹ P_(k) ¹(|

_(k+1)|⁻² + λ⁻¹ P_(k) ¹)⁻¹ P_(k) ¹]  P_(k+1) ² = λ⁻¹[P_(k) ² − λ⁻¹ P_(k) ²(| 

_(k+1)|⁻² + λ⁻¹ P_(k) ²)⁻¹ P_(k) ²] (4) Update the equalizer weights

_(k+1) recursively as  

_(k+1) =  

_(k) + diag([P_(k+1) ¹ P_(k+1) ²])(

_(k+1) ^(H)ε_(k+1) − α_(k+1)G^(T))

For example, let the m^(th) user's CFO normalized by the sub-carrier spacing be denoted by Ω_(m) where 0≤Ω_(m)≤0.5. In addition, a diagonal matrix can be defined to characterize the effects of CFO as C ^((m))=diag([e ^(j2πΩ) ^(m) ^(×0/N) e ^(j2πΩ) ^(m) ^(×1/N) . . . e ^(j2πΩ) ^(m) ^(×(N−1)/N)])  (59) In this case, the pre-DFT received signal can be expressed as

$\begin{matrix} {y = {{\sum\limits_{m = 1}^{K}{C^{(m)}\left( {s^{(m)}h^{(m)}} \right)}} + n^{(m)}}} & (60) \end{matrix}$ After applying the N-point DFT, the received signal can be expressed by

$\begin{matrix} {\overset{\prime}{y} = {{\sum\limits_{m = 1}^{K}{C^{(m)}{\hat{\Lambda}}^{(m)}R^{(m)}X^{(m)}}} + N}} & (61) \end{matrix}$ Where C^((m)) is a circulant matrix with entries

${C_{p,q}^{(m)} = {\frac{1}{N}{\sum\limits_{n = 0}^{N - 1}e^{j\; 2\;{\pi{({\Omega^{(m)} + p - q})}}{n/N}}}}},p,{q = 1},\ldots\mspace{14mu},{N.}$

FIG. 4 is an exemplary illustration of the structure of the channel matrix C^((m)){circumflex over (Λ)}^((m)). As shown in FIG. 4, the energy of the matrix is concentrated in three primary diagonals of the channel matrix. In one implementation, an assumption is made that the channel matrix entries other than the three primary diagonals are equal to zero such that the AFD-DFE 200 is a three tap equalizer in the frequency-domain. After de-mapping occurs at the de-mapping module 118, the m^(th) user's received signal is equal to

^((m))=

^((m)T)

To simplify the notation, the superscript m can be ignored, and

In addition, the equalizer tap matrix can have a structure that corresponds to the structure of the channel matrix such that the equalizer tap matrix has non-zero entries on the three primary diagonals. For example, the output of a minimum mean square error (MMSE) 3-tap LE can be represented as

=[

(0),

(1), . . . ,

(M−1)]^(T)

$\begin{matrix} {\begin{bmatrix} {\overset{\Cup}{X}(0)} \\ {\overset{\Cup}{X}(1)} \\ {\overset{\Cup}{X}(2)} \\ \vdots \\ {\overset{\Cup}{X}\left( {M - 1} \right)} \end{bmatrix} = {\quad{\left\lbrack \begin{matrix} {F_{1}(0)} & {F_{2}(0)} & \; & \; & \; \\ {F_{1}(1)} & {F_{2}(1)} & {F_{3}(1)} & \; & \; \\ \; & {F_{1}(2)} & \ddots & \ddots & \; \\ \; & \; & \ddots & \ddots & {F_{3}\left( {M - 2} \right)} \\ \; & \; & \; & {F_{1}\left( {M - 1} \right)} & {F_{2}\left( {M - 1} \right)} \end{matrix} \right\rbrack\left\lbrack \begin{matrix} {y(0)} \\ {y(1)} \\ {y(2)} \\ \vdots \\ {y\left( {M - 1} \right)} \end{matrix} \right\rbrack}}} & (62) \end{matrix}$ where

_(i)(j) represents the tap of the LE. In addition, the output of a DFE can be represented as

$\begin{matrix} {\begin{bmatrix} {\hat{X}(0)} \\ {\hat{X}(1)} \\ {\hat{X}(2)} \\ \vdots \\ {\hat{X}\left( {M - 1} \right)} \end{bmatrix} = {\begin{bmatrix} {\mathcal{F}_{1}(0)} & {\mathcal{F}_{2}(0)} & \; & \; & \; \\ {\mathcal{F}_{1}(1)} & {\mathcal{F}_{2}(1)} & {\mathcal{F}_{3}(1)} & \; & \; \\ \; & {\mathcal{F}_{1}(2)} & \ddots & \ddots & \; \\ \; & \; & \ddots & \ddots & {\mathcal{F}_{3}\left( {M - 2} \right)} \\ \; & \; & \; & {\mathcal{F}_{1}\left( {M - 1} \right)} & {\mathcal{F}_{2}\left( {M - 1} \right)} \end{bmatrix}{\quad{\begin{bmatrix} {y(0)} \\ {y(1)} \\ {y(2)} \\ \vdots \\ {y\left( {M - 1} \right)} \end{bmatrix} + {\begin{bmatrix} {B(0)} & \; & \; & \; & \; \\ \; & {B(1)} & \; & \; & \; \\ \; & \; & {B(2)} & \; & \; \\ \; & \; & \; & \ddots & \; \\ \; & \; & \; & \; & {B\left( {M - 1} \right)} \end{bmatrix}\begin{bmatrix} {D(0)} \\ {D(1)} \\ {D(2)} \\ \vdots \\ {D\left( {M - 1} \right)} \end{bmatrix}}}}}} & (63) \end{matrix}$ where

_(i)(j) and

_(i)(j) represent the tap coefficient of the feedforward and feedback filter, respectively. For an adaptive solution for the AFD-DFE, the result may not be an explicit solution. Denoting

_(i)=[

(i−1)

(i)

(i+1)] for i=1, 2, . . . , M−2,

₀=[[

(0) [

(1)], equation (63) can be written alternatively as

$\begin{matrix} {\begin{bmatrix} {\overset{˘}{X}(0)} \\ {\overset{˘}{X}(1)} \\ {\overset{˘}{X}(2)} \\ \vdots \\ {\overset{˘}{X}\left( {M - 1} \right)} \end{bmatrix} = {{\begin{bmatrix} \mathcal{U}_{0} & \; & \; & \; & \; \\ \; & \mathcal{U}_{1} & \; & \; & \; \\ \; & \; & \mathcal{U}_{2} & \; & \; \\ \; & \; & \; & \ddots & \; \\ \; & \; & \; & \; & \mathcal{U}_{M - 1} \end{bmatrix}\begin{bmatrix} {\mathcal{F}_{1}(0)} \\ {\mathcal{F}_{2}(0)} \\ {\mathcal{F}_{1}(1)} \\ \vdots \\ {\mathcal{F}_{2}\left( {M - 1} \right)} \end{bmatrix}} + {\quad{\begin{bmatrix} {D(0)} & \; & \; & \; & \; \\ \; & {D(1)} & \; & \; & \; \\ \; & \; & {D(2)} & \; & \; \\ \; & \; & \; & \ddots & \; \\ \; & \; & \; & \; & {D\left( {M - 1} \right)} \end{bmatrix}\begin{bmatrix} {B(0)} \\ {B(1)} \\ {B(2)} \\ \vdots \\ {B\left( {M - 1} \right)} \end{bmatrix}}}}} & (64) \end{matrix}$ Equation (64) can be written in compact notation, at instant k, as {hacek over (X)} _(k)=

_(k)

_(k−1)+

_(k)

_(k−1)  (65) Defining

_(k), ε_(k), and

_(k) as in equations (46), (47), and (51), the RLS update can be represented by equation (57).

In addition, the computational complexity of the 3-tap CRLS AFD-DFE is reduced because matrix inversion is not implemented in the AFD-DFE. For example, starting with k=0 and using P₀ ¹=ϵI_(3M−2), P₁ ¹ can be represented by

$\begin{matrix} \begin{matrix} {P_{1}^{1} = {\lambda^{- 1}\left\lbrack {P_{0}^{1} - {\lambda^{- 1}P_{0}^{1}{Z_{1}^{H}\left( {I_{M} + {\lambda^{- 1}Z_{1}P_{0}^{1}Z_{1}^{H}}} \right)}^{- 1}Z_{1}P_{0}^{1}}} \right\rbrack}} \\ {= {\lambda^{- 1}{{{\epsilon^{- 1}I_{{3M} - 2}} - {\lambda^{- 1}\epsilon^{- 1}I_{{3M} - 2}{Z_{1}^{H}\left( {I_{M} + {\lambda^{- 1}\epsilon^{- 1}Z_{1}Z_{1}^{H}}} \right)}^{- 1}}}}}} \\ {{Z_{1}\epsilon^{- 1}I_{{3M} - 2}}} \end{matrix} & (66) \end{matrix}$ Because

₁

₁ ^(H)=diag [|

_(0,1)|² |

_(1,1)|² . . . |

_(M−1,1)|²], solving equation (66) does not necessitate using matrix inversion. Also,

$\begin{matrix} {{{Z_{1}^{H}\left( {I_{M} + {\lambda^{- 1}\epsilon^{- 1}Z_{1}Z_{1}^{H}}} \right)}^{- 1}Z_{1}} = \begin{bmatrix} \phi_{0} & \; & \; & \; \\ \; & \phi_{1} & \mspace{2mu} & \; \\ \; & \; & \ddots & \; \\ \; & \; & \; & \phi_{M - 1} \end{bmatrix}} & (67) \end{matrix}$ whereas the entries ϕ_(i), I=1, . . . , M−2 are 3×3 matrices and ϕ_(i), I=0, . . . , M−1 are 2×2 matrices given by ø_(i)=

_(i,1) ^(H)(1+λ⁻¹ϵ⁻¹|

_(i,1)|²)

_(i,1), i=0, . . . , M−1. Thus,

₁ ¹ can have the following structure:

$\begin{matrix} {P_{1}^{1} = \begin{bmatrix} P_{1,0}^{1} & \; & \; & \; \\ \; & P_{1,1}^{1} & \; & \; \\ \; & \; & \ddots & \; \\ \; & \; & \; & P_{1,{M - 1}}^{1} \end{bmatrix}} & (68) \end{matrix}$ where P_(1,i) ¹=λ⁻¹[ϵ⁻¹I_(d)−λ⁻¹ϵ⁻²ø_(i)], d=2 for i=0, M−1 and d=3 for i=1, . . . , M−2. For k=1, P ₂ ¹=λ⁻¹ [P ₁ ¹−λ⁻¹ P ₁ ¹

₂ ^(H)(I _(M)+λ⁻¹

₂ P ₁ ¹

₂ ^(H))⁻¹

₂ P ₁ ¹]  (69) where

₂P₁ ¹

₂ ^(H)=diag[

_(0,2)P_(1,0) ¹

_(0,2) ^(H)

_(1,2)P_(1,1) ¹

_(1,2) ^(H) . . .

_(M−1,2)P_(1,M−1) ¹

_(M−1,2) ^(H)] and

X_(i,2)

₁ ¹

_(i,2) ^(H) is a scalar quantity, thus the matrix inversion equates to M scalar inversions. For k>1,

_(k) ¹ has a structure that corresponds to the matrix structure for k=1. Also, the method of determining of

_(k) ² corresponds to the method of determining

_(k) ² for a one-tap AFD-DFE for SFBC.

According to certain embodiments, the AFD-DFE design can also be applied to MIMO SC-FDMA systems. MIMO systems can be used to enhance performance (spatial diversity) and/or to increase the data transfer rate (spatial multiplexing) of a data transfer system, such as a transceiver. One type of coding scheme that can be applied to a MIMO system is space-frequency block coding (SFBC).

FIG. 5 is an exemplary illustration of a transceiver 600 for SFBC SC-FDMA system, according to certain embodiments. On technique for achieving spatial diversity is Conventional Space-Frequency Block Coding (C-SFBC), which can be implemented on a block level for the AFD-DFE. An M-point DFT module 152 applies the M-point DFT to a data block, x^((m)), to transform the signal block x^((m)) to the frequency-domain symbol, X^((m))=[X(0)^((m)); X(1)^((m)); . . . ; X(M−1)^((m))]^(T). At the SFBC module 602, the processing circuitry of the transceiver 600 applies the SFBC coding scheme to the data block, which results in X₂ ^((m))=[X(0)^((m)); −X*(1)^((m)); . . . ; X(M−2)^((m)); −X*(M−1)^((m))]^(T), and X₁ ^((m))=[X(1)^((m)); X*(0)^((m)); . . . ; X(M−1)^((m)); X*(M−2)^((m))]^(T), where (.)* denotes the complex-conjugate operation. After mapping is performed at mapping module 154, and the circuitry applies the N-point IDFT at the IDFT module 156, the transmit signals from the two antennas are referred to as s₁ ^((m)) and s₂ ^((m)) which correspond to X₁ ^((m)) and X₂ ^((m)). In addition, CP is inserted into the transmit signals at CP insertion module 158.

The transmit signals can be transmitted across a channel via transmitter 160 and received at receiver 162. At the receiver 610, the circuitry of the transceiver 600 performs CP removal at the CP removal module 164 and applies an N-point DFT at N-point DFT module 166. In addition, at sub-carrier de-mapping module 168, the circuitry de-maps the received signal as discussed with respect to the de-mapping module 118 in transceiver 100.

The received signal for the m^(th) user, after received signal is decoded at the SFBC module 604, can be expressed as

$\begin{matrix} \begin{matrix} {y_{oc} = {\begin{bmatrix} y_{o} \\ y_{e}^{*} \end{bmatrix} = {{\begin{bmatrix} \Lambda_{1o} & \Lambda_{2o} \\ \Lambda_{2c}^{*} & {- \Lambda_{1c}^{*}} \end{bmatrix}\begin{bmatrix} \chi_{o} \\ \chi_{e} \end{bmatrix}} + \begin{bmatrix} N_{o} \\ N_{e}^{*} \end{bmatrix}}}} \\ {\overset{\Delta}{=}{{\Lambda\chi}_{oc} + N_{oc}}} \end{matrix} & (70) \end{matrix}$ where

₀(X₀) and

_(e)(X_(e)) represent the odd and even components, respectively, of the frequency-domain received signal

(X). Also, Λ_(io) and Λ_(ie) are diagonal matrices including odd and even components, respectively, of the frequency-domain channel corresponding to the i^(th) transmit antenna. In some implementations, it can be assumed that Λ_(io)=Λ_(ie) for i=1, 2, such that Λ becomes an Alamouti-like matrix. According to certain embodiments, an Alamouti-type matrix has the form

$\begin{bmatrix} \Delta_{1} & \Delta_{2} \\ \Delta_{1}^{*} & \Delta_{2}^{*} \end{bmatrix}.$ The output from the equalizer 170 can be expressed as

$\begin{matrix} {\begin{bmatrix} {\overset{˘}{\chi}}_{o} \\ {\overset{˘}{\chi}}_{c} \end{bmatrix} = {\left( {{\Lambda^{H}\Lambda} + {\frac{1}{SNR}I_{2M}}} \right)^{1}\lambda^{H}{y_{oc}.}}} & (71) \end{matrix}$ where SNR is the signal-to-noise ratio at the receiver 610. Since {tilde over (Λ)}Λ^(H) has an Alamouti-type structure, then

$\begin{matrix} {\begin{bmatrix} {\overset{˘}{\chi}}_{o} \\ {\overset{˘}{\chi}}_{c} \end{bmatrix} = {\begin{bmatrix} \Phi_{1} & \Phi_{2} \\ \Phi_{2}^{*} & {- \Phi_{1}^{*}} \end{bmatrix}\begin{bmatrix} y_{o} \\ y_{c}^{*} \end{bmatrix}}} & (72) \end{matrix}$ where Φ₁ and Φ₂ are diagonal matrices. Alternatively, equation (72) can be written as

$\begin{matrix} {\begin{bmatrix} {\overset{˘}{\chi}}_{o} \\ {\overset{˘}{\chi}}_{c}^{*} \end{bmatrix} = {\begin{bmatrix} {{diag}\left( y_{o} \right)} & {{diag}\left( y_{c}^{*} \right)} \\ {- {{diag}\left( y_{c} \right)}} & {{diag}\left( y_{o}^{*} \right)} \end{bmatrix}\begin{bmatrix} \Upsilon_{1} \\ \Upsilon_{2} \end{bmatrix}}} & (73) \end{matrix}$ where

₁ and

₂ are vectors that include the diagonal elements of Φ₁ and Φ₂. The equalizer output from a DFE can be expressed as

$\begin{matrix} \begin{matrix} {{\overset{˘}{\chi}}_{oc} = {\begin{bmatrix} {\overset{˘}{\chi}}_{o} \\ {\overset{˘}{\chi}}_{c}^{*} \end{bmatrix} = {{\begin{bmatrix} {{diag}\left( y_{o} \right)} & {{diag}\left( y_{c}^{*} \right)} \\ {- {{diag}\left( y_{c} \right)}} & {{diag}\left( y_{o}^{*} \right)} \end{bmatrix}\begin{bmatrix} \Upsilon_{1} \\ \Upsilon_{2} \end{bmatrix}} +}}} \\ {\begin{bmatrix} {{diag}\left( D_{o} \right)} & 0 \\ 0 & {{diag}\left( D_{c}^{*} \right)} \end{bmatrix}\begin{bmatrix} \Psi_{1} \\ \Psi_{2} \end{bmatrix}} \\ {\overset{\Delta}{=}{{Z\;\mathcal{F}} + {DB}}} \end{matrix} & (74) \end{matrix}$ where

_(o) and

_(e) are X_(o) and X_(e), respectively, for the training mode or frequency-domain decisions on {hacek over (X)}_(o) and {hacek over (X)}_(e), respectively, for the decision-directed mode. The feedforward and feedback filter coefficients in the frequency-domain are

=[F(0) . . . F(M−1)]^(T) and

=[

(0) . . .

(M−1)]^(T) including the elements {

₁,

₂} and {Ψ₁, Ψ₂}, respectively. Moreover, Z is an M×M Alamouti-type matrix including the received symbols, and

is a diagonal matrix including decisions. The coefficients for the feedforward and feedback filters are computed adaptively so that exact solutions may not be required. At the k^(th) instant, the output of the equalizer can be represented as {hacek over (X)} _(oe,k)=

_(k)

_(k−1)+

_(k)

_(k−1)  (75) The AFD-DFE recursions for the constrained algorithm can be represented by equation (52) with G=[0_(1×(3M−2)) 1_(1×M)], and the AFD-DFE recursions for the unconstrained algorithm can be represented by equation (53). The error vector can be represented by

$\begin{matrix} {ɛ_{k} = \begin{bmatrix} {D_{k} - {\overset{˘}{\chi}}_{{oe},k}} \\ {D_{k} - {\overset{\sim}{\chi}}_{{oe},k}} \end{bmatrix}} & (76) \end{matrix}$ where

_(k) denotes the decisions made at the k^(th) instant, e.g.,

${D_{k} = \begin{bmatrix} D_{o,k} \\ D_{e,k}^{*} \end{bmatrix}},$ and

_(k), and

_(k) are as in equations (46) and (51), respectively.

The output of the equalizer 170 is fed to a M-point IDFT module 172 where the equalized symbols are transformed back to the time domain via IDFT.

According to certain embodiments, the RLS update equations (52) and (53) do not necessitate the performance of matrix inversion, which reduces an overall computation complexity of the AFD-DFE. For example, the matrix

_(k+1) has a diagonal structure such that

_(k+1)=diag([

_(k+1) ¹

_(k+1) ²]), where

_(k+1) ¹ and

_(k+1) ² are also diagonal and

_(k+1) ¹ can be represented by

$\begin{matrix} {{P_{k + 1}^{1} = {\lambda^{- 1}\left\lbrack {P_{k}^{1} - {\lambda^{- 1}P_{k}^{1}{Z_{k + 1}^{H}\left( {I_{M} + {\lambda^{- 1}Z_{k + 1}P_{k}^{1}Z_{k + 1}^{H}}} \right)}^{- 1}Z_{k + 1}P_{k}^{1}}} \right\rbrack}}{{Simplifying}\mspace{14mu}{the}\mspace{14mu}{term}\mspace{14mu}\left( {\lambda^{- 1}Z_{k + 1}P_{k}^{1}Z_{k + 1}^{H}} \right)\mspace{14mu}{results}\mspace{14mu}{in}}\begin{matrix} {{\lambda^{- 1}Z_{k + 1}P_{k}^{1}Z_{k + 1}^{H}} = {\lambda^{- 1}P_{k}^{1}Z_{k + 1}Z_{k + 2}^{H}}} \\ {= {\lambda^{- 1}{P_{k}^{1}\begin{bmatrix} {{diag}\left( y_{o,{k + 1}} \right)} & {{diag}\left( y_{e,{k + 1}}^{*} \right)} \\ {- {{diag}\left( y_{e,{k + 1}} \right)}} & {{diag}\left( y_{o,{k + 1}}^{*} \right)} \end{bmatrix}}}} \\ {\begin{bmatrix} {{diag}\left( y_{o,{k + 1}}^{*} \right)} & {- {{diag}\left( y_{e,{k + 1}}^{*} \right)}} \\ {{diag}\left( y_{e,{k + 1}} \right)} & {{diag}\left( y_{o,{k + 1}} \right)} \end{bmatrix}} \\ {= {\lambda^{- 1}{P_{k}^{1}\begin{bmatrix} \begin{matrix} {{{diag}\left( {y_{o,{k + 1}}}^{2} \right)} +} \\ {{diag}\left( {y_{e,{k + 1}}}^{2} \right)} \end{matrix} & 0 \\ 0 & \begin{matrix} {{{diag}\left( {y_{o,{k + 1}}}^{2} \right)} +} \\ {i\left( {y_{e,{k + 1}}}^{2} \right)} \end{matrix} \end{bmatrix}}}} \end{matrix}} & (77) \end{matrix}$

Now

_(k+1) ^(H)(I_(M)+λ⁻¹

_(k+1)P_(k) ¹

_(k+1) ^(H))⁻¹

_(k+1)=diag([ø ø])

ψ_(kω1) ¹, where ϕ is diagonal given as ϕ=(diag(|

_(o,k+1)|²)+diag(|

_(e,k+1)|²))⁻¹+λ⁻¹P_(k) ¹]⁻¹.

_(k+1) ¹ can have the form P _(k−1) ¹=λ⁻¹ [P _(k) ¹−λ⁻¹ P _(k) ¹ψ_(k+1) ¹ P _(k) ¹]  (78) Using the approach discussed previously with respect to the SISO implementation, P _(k−1) ²=λ⁻¹ [P _(k) ²−λ⁻¹ P _(k) ²ψ_(k+1) ² P _(k) ²]  (79) where ψ_(k+1) ²=(|

_(k+1)|⁻²+λ⁻¹P_(k) ²)⁻¹. The RLS AFD-DFE recursion for the constrained algorithm has the form

_(k−1)=

_(k)+diag([P _(k+1) ¹ P _(k−1) ²])(

_(k+1) ^(H)ε_(k−1)−α_(k−1) G ^(T))  (80) The RLS AFD-DFE recursion for the unconstrained algorithm has the form

_(k+1)=

_(k)+diag([P _(k+1) ¹ P _(k+1) ²])(

_(k+1) ^(H)ε_(k+1)  (81) where

_(k+1) ¹ and

_(k+1) ² are defined by equations (78) and (79), respectively.

FIG. 6 is an exemplary block diagram of an AFD-DFE 700 for a transceiver having two-transmit once-receive antenna, according to certain embodiments. For example, the AFD-DFE 700 can be used as the equalizer 170 in the transceiver 600. The AFD-DFE 700 can be implemented in MIMO SC-FDMA systems and can employ algorithms such as the C-SFBC and E-SFBC RLS algorithms described previously herein. At the odd/even expansion module 702, the processing circuitry separates the odd and even components of the received signal,

. At the forming data matrix module 202, the odd component signal

_(o,k) and the even component signal,

_(e,k) are formed into diagonal matrix

_(k).

The AFD-DFE 700 uses a training mode to allow the algorithm to converge before switching to a decision-directed mode where the AFD-DFE 700 uses previous decisions to update the weighting factors of the adaptive feedforward filter 204 and adaptive feedback filter 206. For example, in the training mode, the AFD-DFE 700 receives known M-block training signal, X_(oe,k), which is compared to the received signal after the adaptive feedforward filter 204 and adaptive feedback filter 206 have been applied according to equation (75), denoted by {hacek over (X)}_(oe,k), at the summing module 216. The output of the summing module 206 is an error signal that is used to update the coefficients of the adaptive feedforward filter 204 and the adaptive feedback filter 206. The error signal is generated by comparing the equalizer output with a desired response in the frequency-domain, which is applied in equations (80) and (81). In addition, the M-block training signal is used to form the decision matrix,

_(k), at the form decision matrix module 208, which is then used to update the adaptive feedback filter 206. To prevent the algorithm of the AFD-DFE 700 from diverging, such as in the case of fast fading channels, retraining blocks can also be used.

In the decision-directed mode, the AFD-DFE 700 uses the equalizer output from the previous iteration after an M-point DFT is applied at the DFT module 214, and then rearranged at the rearranging module 706 to calculate the error signal at the summing module 216. For example, the error signal is based on the difference between the output of the rearranging module 706, {circumflex over (X)}_(oe,k), and the received signal after the adaptive feedforward filter 204 and adaptive feedback filter 206 have been applied, denoted by {hacek over (X)}_(oe,k), according to equation (8). The adaptive feedforward filter 204 and the adaptive feedback filter 206 are updated based on the error signal output from the summing module 216. In addition, the iterated decision, {circumflex over (X)}_(oe,k), can be used to produce the decision matrix

_(k), at the form decision matrix module 208, which is then used to update the adaptive feedback filter 206. The equalizer output, {hacek over (X)}_(oe,k), which includes the odd and event components, is rearranged at the rearranging module 704 and is transformed to the time domain using an IDFT at the IDFT module 210. The output signal from the equalizer 170 is denoted as {circumflex over (x)}_(k).

In the presence of CFO and high Doppler, severe ICI from adjacent carriers occurs which can destroy the Alamouti structure and can result in performance degradation. Therefore, B. Narasimhan, N. Al-Dhahir, and H. Minn, “SFBC design tradeoffs for mobile SC-FDMA with application to LTE-advanced,” IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3458-3461, 2010, of which the entire contents are incorporated by reference in their entirety, proposes embedded SFBC (E-SFBC) which preserves the Alamouti structure even when ICI is present. In addition, E-SFBC does not affect the low PAPR property of SC-FDMA, which may not exist in C-SFBC. In some embodiments, the AFD-DFE can be implemented with E-SFBC at the block level. In E-SFBC, pilots may not be used, and the blocks may not be divided into two signals. In E-SFBC, X₁ ^((m))=[X(0)^((m)), X(2)^((m)), . . . , X(M−2)^((m))]^(T) and X₂ ^((m))=[X(1)^((m)), X(3)^((m)), . . . , X(M−1)^((m))]^(T) such that X^((m)) is divided into two blocks. In some implementations, the sequence to transmit the sub-blocks is

${\overset{\prime}{X}}_{1} = \begin{bmatrix} X_{1}^{(m)} \\ {- X_{2}^{*{(m)}}} \end{bmatrix}$ and

${\overset{\prime}{X}}_{2} = \begin{bmatrix} X_{2}^{(m)} \\ {- X_{1}^{*{(m)}}} \end{bmatrix}$ for a first antenna and a second antenna, respectively. After mapping and applying N-point IDFT, the transmitted signals from the two antennas are s₁ ^((m)) and s₂ ^((m)) which correspond to X₁ ^((m)) and X₂ ^((m)). The transmitted signals can be circularly convolved with their respective channels, and the received signal, after applying the N-DFT becomes

$\begin{matrix} {\overset{\prime}{y} = {{\sum\limits_{m = 1}^{K}\left\{ {{{\hat{\Lambda}}_{1}^{(m)}R^{(m)}{\overset{\prime}{\chi}}_{1}} + {{\hat{\Lambda}}_{2}^{(m)}R^{(m)}{\overset{\prime}{\chi}}_{2}}} \right)} + \mathcal{N}}} & (82) \end{matrix}$ where {circumflex over (Λ)}^((m)) is a N×N diagonal matrix, {circumflex over (Λ)}^((m))=diag(DFT(h_(i) ^((m))) for i=1, 2, and

is noise component with variance σ

I_(N). The received signal for the m^(th) user, after de-mapping, can be expressed as

^((m)) =R ^((m)T){circumflex over (Λ)}₁ ^((m)) R ^((m)) [X ₁ ^((m)) −X ₂*^((m))]^(T) +R ^((m)T){circumflex over (Λ)}₂ ^((m)) R ^((m)) [X ₂ ^((m)) X ₁*^((m))]^(T)+

^((m))  (83)

If Λ_(i) ^((m))=

^((m)T){circumflex over (Λ)}_(i) ^((m))

^((m)) for i=1, 2, then Λ_(i) ^((m)) is a M×M diagonal matrix. To simplify the notation, the superscript m can be removed, and equation (83) can be written as

₁=Λ₁₁ X ₁+Λ₁₂ X ₂+

₁  (84)

₂=Λ₂₁ X ₁*−Λ₂₂ X ₂*+

₂  (85) where Λ₁=diag[Λ₁₁ Λ₂₂] and Λ₂=diag[Λ₁₂ Λ₂₁]. Combining equations (84) and (85) results in

$\begin{matrix} \begin{matrix} {y = \begin{bmatrix} y_{1} \\ y_{2}^{*} \end{bmatrix}} \\ {= {{\begin{bmatrix} \Lambda_{11} & \Lambda_{12} \\ \Lambda_{21}^{*} & {- \Lambda_{22}^{*}} \end{bmatrix}\begin{bmatrix} \chi_{1} \\ \chi_{2} \end{bmatrix}} + \begin{bmatrix} \mathcal{N}_{1} \\ \mathcal{N}_{2}^{*} \end{bmatrix}}} \\ {\overset{\Delta}{=}{{\Lambda\chi}_{12} + \mathcal{N}_{12}}} \end{matrix} & (86) \end{matrix}$ For Alamouti structure, Λ₁₁=Λ₂₂ and Λ₁₂=Λ₂₁. In addition, the sub-carriers can be reordered before mapping at the transmitter as O{acute over (X)}₁ and O{acute over (X)}₂ where O=[I₁, I₁, I₁, I₁, . . . , I₁, I₁], and it may be assumed that the channel does not change over two consecutive sub-carriers. At the receiver, the reordering is done after de-mapping by using a matrix O^(T). The remaining steps performed at the receiver at the equalization module 620 and M-point IDFT module 622 correspond to the steps performed for the C-SFBC signals.

Due to carrier frequency offset (CFO) in SFBC SC-FDMA signals, Λ_(ij) in equation (86) can lose the diagonal structure. The matrices of equation (86) can be approximated as a banded (tri-diagonal) structure. It can also be assumed, in some implementations, that the structure of the feedforward matrices corresponds to the structure of the channel matrices, and the equalized signal can be represented as

$\begin{matrix} {\begin{bmatrix} {\overset{˘}{\chi}}_{1} \\ {\overset{˘}{\chi}}_{2} \end{bmatrix} = {{\begin{bmatrix} \Phi_{1} & \Phi_{2} \\ \Phi_{2}^{*} & {- \Phi_{1}^{*}} \end{bmatrix}\begin{bmatrix} y_{1} \\ y_{2}^{*} \end{bmatrix}} + {\begin{bmatrix} \Theta_{1} & 0 \\ 0 & \Theta_{2} \end{bmatrix}\begin{bmatrix} D_{1} \\ D_{2} \end{bmatrix}}}} & (87) \end{matrix}$ where Φ_(i) is a tri-diagonal matrix and Θ_(i) is a diagonal matrix.

₁ and

₂ are X₁ and X₂, respectively, for the training mode or frequency-domain decisions on {hacek over (X)}₁ and {hacek over (X)}₂, respectively, for the decision-directed mode. According to certain embodiments, exact knowledge of the equalizer tap matrices may not be required to obtain an adaptive solution. If

_(i)=[

(i−2)

(i)

(i+2)] for i=2, . . . , M−3,

_(i)=[

(i)

(i+2)] for i=0,1, and

_(i)=[

(i−2)

(i)] for i=M−1, M−2, then equation (87) can be expressed as

$\begin{matrix} \begin{matrix} {\overset{˘}{\chi} = {\begin{bmatrix} {\overset{˘}{\chi}}_{1} \\ {\overset{˘}{\chi}}_{2} \end{bmatrix} = {{\begin{bmatrix} Z_{0} & Z_{1}^{*} \\ {- Z_{1}} & Z_{0}^{*} \end{bmatrix}\begin{bmatrix} Y_{1} \\ Y_{2} \end{bmatrix}} + {\begin{bmatrix} {{diag}\left( D_{1} \right)} & 0 \\ 0 & {{diag}\left( D_{2}^{*} \right)} \end{bmatrix}\begin{bmatrix} \Psi_{1} \\ \Psi_{2} \end{bmatrix}}}}} \\ {\overset{\Delta}{=}{{Z\;\mathcal{F}} + {DB}}} \end{matrix} & (88) \end{matrix}$ Where

_(j)=diag[

_(j),

_(j+2), . . . ,

_(j+M−2)] for j=0, 1.

₁ and

₂ (Ψ₁ and Ψ₂) are the vectors having diagonal elements Φ₁ and Φ₂ (Θ₁ and Θ₂). In addition, the feedforward and feedback filter coefficients in the frequency-domain are

and

and have elements {

₁,

₂} and {Ψ₁, Ψ₂}, respectively. Defining

_(k) as in equation (46), ε_(k) as in equation (47), and

_(k) as in equation (51), the RLS update is equal to equation (57) for the constrained case with G=[0_(1×(3M−2)) 1_(1×M)] and equation (58) for the unconstrained case.

Due to the structure of the SFBC matrix, matrix inversion can be avoided, and the complexity of determining the equalizer output is reduced. Starting with k=0 and using

$\begin{matrix} {{P_{0}^{1} = {\epsilon^{- 1}I_{{3M} - 4}}},{P_{1}^{1}\mspace{14mu}{can}\mspace{14mu}{be}\mspace{14mu}{represented}\mspace{14mu}{by}}} & \; \\ {\begin{matrix} {P_{1}^{1} = {\lambda^{- 1}\left\lbrack {P_{0}^{1} - {\lambda^{- 1}P_{0}^{1}{{\overset{-}{Z}}_{1}^{H}\left( {I_{M} + {\lambda^{- 1}{\overset{-}{Z}}_{1}P_{0}^{1}{\overset{-}{Z}}_{1}^{H}}} \right)}^{- 1}{\overset{-}{Z}}_{1}P_{0}^{1}}} \right\rbrack}} \\ {= {\lambda^{- 1}\left\lbrack {{\epsilon^{- 1}I_{{3M} - 4}} - {\lambda^{- 1}\epsilon^{- 1}I_{{3M} - 4}{{\overset{-}{Z}}_{1}^{H}\left( {I_{M} + {\lambda^{- 1}\epsilon^{- 1}{\overset{-}{Z}}_{1}{\overset{-}{Z}}_{1}^{H}}} \right)}^{- 1}}} \right.}} \\ \left. {{\overset{-}{Z}}_{1}\epsilon^{- 1}I_{{3M} - 4}} \right\rbrack \end{matrix}{{Now},}} & (89) \\ {{{\overset{-}{Z}}_{1}{\overset{-}{Z}}_{1}^{H}} = \begin{bmatrix} {{Z_{0}Z_{0,1}^{H}} + {Z_{1,1}^{*}Z_{1,1}^{T}}} & {{{- Z_{0,1}}Z_{1,1}^{H}} + {Z_{1,1}^{*}Z_{0,1}^{T}}} \\ {{{- Z_{1,1}}Z_{0,1}^{H}} + {Z_{0,1}^{*}Z_{1,1}^{T}}} & {{{- Z_{1,1}}Z_{1,1}^{H}} + {Z_{0,1}^{*}Z_{0,1}}} \end{bmatrix}} & (90) \end{matrix}$ In addition, Z₀Z₀ ^(H)+Z₁*Z₁ ^(T)=diag[|

_(0,1)|²|

_(1,1)|²|

_(2,1)|²+|

_(3,1)|² . . . |

_(M−2,1)|²+|

_(M−1,1)|²]s a diagonal matrix as well as the other matrix entries in equation (90). Therefore,

₁

₁ ^(H) is a M×M matrix having four

$\frac{M}{2} \times \frac{M}{2}$ diagonal matrices, which allows the inverse of equation (89) to be computed. Now,

${{Z_{1}^{H}\left( {I_{M} + {\lambda^{- 1}\epsilon^{- 1}Z_{1}Z_{1}^{H}}} \right)}^{- 1}Z_{1}} = {\begin{bmatrix} \varnothing_{0} & \varnothing_{1} \\ \varnothing_{2} & \varnothing_{3} \end{bmatrix} = \varnothing}$ and

$\begin{matrix} {\varnothing_{i} = \begin{bmatrix} \varnothing_{i,0} & \; & \; & \; \\ \; & \varnothing_{i,1} & \; & \; \\ \; & \; & \ddots & \; \\ \; & \; & \; & \varnothing_{i,\frac{M}{2}} \end{bmatrix}} & (91) \end{matrix}$ The entries

$\varnothing_{i,j},{j = 1},\ldots\mspace{14mu},{\frac{M}{2} - 1}$ are 3×3 matrices and

$\varnothing_{i},{i = 0},\frac{M}{2}$ are 2×2 matrices.

₁ ¹ has a structure that corresponds to the structure of Ø such that

$P_{1}^{1} = {\begin{bmatrix} P_{1,0}^{1} & P_{1,1}^{1} \\ P_{1,2}^{1} & P_{1,3}^{1} \end{bmatrix}.}$ For k=1, P ₂ ¹=λ⁻¹ [P ₁ ¹−λ⁻¹ P ₁ ¹

₂ ^(H)(I _(M)+λ⁻¹

₂ P ₁ ¹

₂ ^(H))⁻¹

₂ P ₁ ¹]  (92) Let

${{\overset{-}{Z}}_{2}P_{1}^{1}{\overset{-}{Z}}_{2}^{H}} = {{\begin{bmatrix} \varphi_{0} & \varphi_{1} \\ \varphi_{2} & \varphi_{3} \end{bmatrix}\mspace{14mu}{where}\mspace{14mu}\varphi_{0}} = {{\left( {{Z_{0}P_{1,0}^{1}} + {Z_{1}^{*}P_{1,2}^{1}}} \right)Z_{0}^{H}} + {\left( {{Z_{0}P_{1,1}^{1}} + {Z_{1}^{*}P_{1,3}^{1}}} \right)Z_{1}^{T}}}}$ is a diagonal matrix in addition to the other entries of the matrix. Therefore, the inverse of equation (92) can be determined by using block matrix inversion as described by T. Kailath, Linear Systems. Englewood Cliffs, N.J.: Prentice Hall, 1980, the entire contents of which is herein incorporated by reference, as follows

$\begin{matrix} {\begin{bmatrix} A & B \\ C & D \end{bmatrix}^{- 1} = \begin{bmatrix} \left( {A - {{BD}^{- 1}C}} \right)^{- 1} & {{- \left( {A - {{BD}^{- 1}C}} \right)^{- 1}}{BD}^{- 1}} \\ {{- D^{- 1}}{C\left( {A - {{BD}^{- 1}C}} \right)}^{- 1}} & \left( {D - {{CA}^{- 1}B}} \right)^{- 1} \end{bmatrix}} & (93) \end{matrix}$ where all sub-matrices A, B, C, and D are diagonal, therefore, the inversion of the sub-matrices A, B, C, and D become scalar inversions. For k>1,

_(k) ¹ has a structure that corresponds to the matrix structure for k=1. Also, the method of determining of

_(k) ² corresponds to the method of determining

_(k) ² for a one-tap AFD-DFE for SFBC.

In order to increase the number of users supported by the system, spatially-multiplexed (SM) SC-FDMA can be used. In SM SC-FDMA, multiple users use the same frequency and times slot to transmit data. The number antennas at the base station (BS) can be equal to the number of users using the same frequency and time slots, and the AFD-DFE is used to perform joint interference cancellation and equalization at the receiver. For example, let N_(t) be the number of transmit antennas and N_(r) be the number of receive antennas. Since the number of receive antennas may be assumed to be equal to the number of users who are transmitting with the same frequency band and time slot, then N_(t)=N_(r), according to certain embodiments.

_(i) represents the frequency-domain received signal after de-mapping at the i^(th) antenna as described by M. A. Ruder, U. L. Dang, and W. H. Gerstacker, “User Pairing for Multiuser SC-FDMA Transmission over Virtual MIMO ISI Channels,” IEEE Global Telecommunications Conference (GLOBECOM), pp. 1-7, November 2009, the entire contents of which is incorporated by reference in its entirety. Collecting the received SC-FDMA symbols into one vector produces

$\begin{matrix} \begin{matrix} {\begin{bmatrix} y_{1} \\ y_{2} \\ \vdots \\ y_{N_{r}} \end{bmatrix} = {{\begin{bmatrix} \Lambda_{11} & \Lambda_{21} & \ldots & \Lambda_{N_{r}1} \\ \Lambda_{21} & \Lambda_{22} & \ldots & \Lambda_{N_{r}2} \\ \vdots & \vdots & \ddots & \vdots \\ \Lambda_{1N_{r}} & \Lambda_{2N_{r}} & \ldots & \Lambda_{N_{r}N_{r}} \end{bmatrix}\begin{bmatrix} \chi_{1} \\ \chi_{2} \\ \vdots \\ \chi_{N_{r}} \end{bmatrix}} + \begin{bmatrix} \mathcal{N}_{1} \\ \mathcal{N}_{2} \\ \vdots \\ \mathcal{N}_{N_{r}} \end{bmatrix}}} \\ {\overset{\Delta}{=}{{\Lambda\chi} + \mathcal{N}}} \end{matrix} & (94) \end{matrix}$ where Λ_(ij) is the frequency-domain channel matrix from the i^(th) user's transmit antenna to the ^(jth) receive antenna having diagonal structure. The feedforward and feedback tap matrices also have a diagonal structure that corresponds to the structure of the channel matrix Λ. The output of the AFD-DFE is represented by

$\begin{matrix} {\begin{bmatrix} {\overset{\Cup}{\chi}}_{1} \\ {\overset{\Cup}{\chi}}_{2} \\ \vdots \\ {\overset{\Cup}{\chi}}_{N_{r}} \end{bmatrix} = {{\begin{bmatrix} \mathcal{F}_{1,1} & \mathcal{F}_{1,2} & \ldots & \mathcal{F}_{1,N_{r}} \\ \mathcal{F}_{2,1} & \mathcal{F}_{2,2} & \ldots & \mathcal{F}_{2,N_{r}} \\ \vdots & \vdots & \ddots & \vdots \\ \mathcal{F}_{N_{r},1} & \mathcal{F}_{N_{r},2} & \ldots & \mathcal{F}_{N_{r},N_{r}} \end{bmatrix}\left\lbrack \begin{matrix} y_{1} \\ y_{2} \\ \vdots \\ y_{N_{r}} \end{matrix} \right\rbrack} + {\quad{\begin{bmatrix} B_{1,1} & B_{1,2} & \ldots & B_{1,N_{r}} \\ B_{2,1} & B_{2,2} & \ldots & B_{2,N_{r}} \\ \vdots & \vdots & \ddots & \vdots \\ B_{N_{r},1} & B_{N_{r},2} & \ldots & B_{N_{r},N_{r}} \end{bmatrix}\begin{bmatrix} D_{1} \\ D_{2} \\ \vdots \\ D_{N_{r}} \end{bmatrix}}}}} & (95) \end{matrix}$ where the elements,

_(i,j) and

_(i,j) are diagonal matrices that include the feedforward and feedback coefficients as diagonal elements, respectively. Next, an adaptive solution for entries of the matrices from equation (95) is determined so that the high cost and rate overhead associated with accurate multi-user channel estimation can be reduced and/or avoided. The desired response

_(i) can be defined as

$D_{i} = \left\{ \begin{matrix} {{F_{M}\left( x_{i} \right)},} & {{for}\mspace{14mu}{training}\mspace{14mu}{mode}} \\ {{F_{M}\left( {\hat{x}}_{i} \right)},} & {{for}\mspace{14mu}{decision}\text{-}{directed}\mspace{14mu}{mode}} \end{matrix} \right.$ Defining

_(i)=diag(

_(i)) and the decision matrix

_(i)=diag(D_(i)), equation (95) becomes

$\begin{matrix} {\begin{bmatrix} {\overset{\Cup}{\chi}}_{1} \\ {\overset{\Cup}{\chi}}_{2} \\ \vdots \\ {\overset{\Cup}{\chi}}_{N_{r}} \end{bmatrix} = {\sum\limits_{i = 1}^{N_{r}}\left( {{\begin{bmatrix} Z_{i} & \; & \; \\ \; & \ddots & \; \\ \; & \; & Z_{i} \end{bmatrix}\begin{bmatrix} \mathcal{F}_{i}^{1} \\ \vdots \\ \mathcal{F}_{i}^{N_{r}} \end{bmatrix}} + {\begin{bmatrix} D_{i} & \; & \; \\ \; & \ddots & \; \\ \; & \; & D_{i} \end{bmatrix}\begin{bmatrix} B_{i}^{1} \\ \vdots \\ B_{i}^{N_{r}} \end{bmatrix}}} \right)}} & (96) \end{matrix}$ where

_(i) ^(j) and

_(i) ^(j) are the vectors having the diagonal elements of

_(j,i) and

_(j,i), respectively. If {hacek over (X)}=[{hacek over (X)}₁ ^(T) . . . {hacek over (X)}_(N) _(r) ^(T)]^(T),

_(i)=[

₁ ^(iT) . . .

_(N) _(r) ^(iT)]^(T),

_(i)=[

₁ ^(iT) . . .

_(N) _(r) ^(iT)]^(T) and

_(i)(

_(i)) is the diagonal matrix having the elements on the diagonal, then equation (96) can be expressed as

$\begin{matrix} {\overset{\Cup}{\chi} = {{\sum\limits_{i = 1}^{N_{r}}\left\lbrack {{{\overset{\prime}{Z}}_{i}\mathcal{F}_{i}} + {{\overset{\prime}{D}}_{i}B_{i}}} \right\rbrack}\overset{\bigtriangleup}{=}{{Z\;\mathcal{F}} + {D\; B}}}} & (97) \end{matrix}$ where

=[

₁ . . .

_(N) _(r) ],

=[

₁ ^(T) . . .

_(N) _(r) ^(T)]^(T),

=[

₁ ^(T) . . .

_(N) _(r) ^(T)]^(T) and

=[

₁ . . .

_(N) _(r) ]. The output of the equalizer at instant k can be represented as {hacek over (X)} _(k)=

_(k)

_(k−1)+

_(k)

_(k−1)  (98) Defining the desired response vector as D_(k)=[D_(1,k) ^(T) . . . N_(N) _(r,k) ^(T)]^(T), then

_(k) can be represented by equation (58),

_(k) as in equation (46), and ε_(k) as in equation (47).

In addition, the complexity of determining the equalizer output can be reduced based on the structure of the matrices used in the equalizer output determination. For example, starting with k=0 and using P₀ ¹=ϵ⁻¹I_((N) _(r) ₎ ₂ _(M), P₁ ¹ can be represented by

$\begin{matrix} \begin{matrix} {P_{1}^{1} = {\lambda^{- 1}\left\lbrack {P_{0}^{1} - {\lambda^{- 1}P_{0}^{1}{Z_{1}^{H}\left( {I_{N_{r}M} + {\lambda^{- 1}Z_{1}P_{0}^{1}Z_{1}^{H}}} \right)}^{- 1}Z_{1}P_{0}^{1}}} \right\rbrack}} \\ {= {\lambda^{- 1}\left\lbrack {{\epsilon^{- 1}I_{{(N_{r})}^{2}M}} -} \right.}} \\ \left. {\lambda^{- 1}\epsilon^{- 1}I_{{(N_{r})}^{2}M}{Z_{I}^{H}\left( {I_{N_{r}M} + {\lambda^{- 1}\epsilon^{- 1}Z_{1}Z_{1}^{H}}} \right)}^{- 1}Z_{1}\epsilon^{- 1}I_{{(N_{r})}^{2}M}} \right\rbrack \end{matrix} & (99) \end{matrix}$ In addition, it can be shown that

₁

₁ ^(H)=[|

_(1,1)|²+|

_(2,1)|²+ . . . |

_(N) _(r) _(,1)|²]. Since |

_(i,1)|² has a diagonal structure, |

₁|² is a diagonal matrix, and so computing equation (99) does not include performing matrix inversion, which can be a costly computational operation. Let

₁ ^(H)(I_(N) _(r) _(M)+λ⁻¹ϵ⁻¹

₁

₁ ^(H))⁻¹

₁=ø be a (N_(r) ²M×N_(r) ²M) block matrix, where each sub-matrix Ø_(i,j) is a diagonal matrix given by ø_(i,j)=

_(i,1) ^(H)(I_(N) _(r) _(M)+λ⁻¹ϵ⁻¹

₁

₁ ^(H))⁻¹

_(j,1) Therefore, the structure of

₁ ¹ corresponds to these structure of Ø where each sub-matrix P_(1,(i,j))=λ⁻¹[ϵ⁻¹I_(N) _(r) _(M)−λ⁻¹ϵ⁻²I_(N) _(r) _(M)ø_(i,j)] is also a diagonal matrix.

For k=1, P ₂ ¹=λ⁻¹ [P ₁ ¹−λ⁻¹ P ₁ ¹

₂ ^(H)(I _(N) _(r) _(M)+λ⁻¹

₂ P ₁ ¹

₂ ^(H))⁻¹

₂ P ₁ ¹]  (100) where

${{\mathfrak{Z}}_{2}P_{1}^{1}{\mathfrak{Z}}_{2}^{H}} = {\sum\limits_{j = 1}^{N_{r}}{{\overset{\prime}{\mathfrak{Z}}}_{j,2}\left( {\sum\limits_{i = 1}^{N_{r}}{{\overset{\prime}{\mathfrak{Z}}}_{i,2}P_{1,{({i,j})}}^{1}}} \right)}}$ is a diagonal matrix. Therefore, the matrix inversion becomes N_(r) ²M scalar inversions. For k>1, the structure of

_(k) ¹ corresponds to the structure of

₁ ¹. In addition, the computation of

_(k) ² follows a process that corresponds to the computation of

_(k) ¹ described herein.

FIG. 7 is an exemplary block diagram of a transceiver 800 for a hybrid SFBC and SM SC-FDMA system, according to certain embodiments. The hybrid SFBC system can include SFBC and SM SC-FDMA coding techniques to signals. The transceiver 800 includes the functionality of SFBC and SM SC-FDMA techniques so that both capacity and performance of the transceiver 800 can be increased. The transceiver 800 has N_(r) signals that are input into an M-point DFT module 152 that applies the M-point DFT to a data block, x^((m)), to transform the signal block x^((m)) to the frequency-domain symbol, X^((m))=[X(0)^((m)); X(1)^((m)); . . . ; X(M−1)^((m))]^(T). At the SFBC module 602, the processing circuitry of the transceiver 600 applies the SFBC coding scheme to the data block, which results in X₂ ^((m))=[X(0)^((m)); −X*(1)^((m)); . . . ; X(M−2)^((m)); −X*(M−1)^((m))]^(T), and X₁ ^((m))=[X(1)^((m)); X*(0)^((m)); . . . ; X(M−1)^((m)); X*(M−2)^((m))]^(T), where (.)* denotes the complex-conjugate operation. After mapping is performed at mapping module 154, and the circuitry applies the N-point IDFT at the IDFT module 156, the transmit signals from the two antennas are referred to as s₁ ^((m)) and s₂ ^((m)), which correspond to X₁ ^((m)) and X₂ ^((m)). In addition, CP is inserted into the transmit signals at CP insertion module 158. The received signal for the m^(th) user, after received signal is decoded at the SFBC module 604.

The transmit signals can be transmitted across a channel via transmitter 160 and received at receiver 162. At the receiver 610, the circuitry of the transceiver 600 performs CP removal at the CP removal module 164 and applies an N-point DFT at N-point DFT module 166. In addition, at sub-carrier de-mapping module 168, the circuitry de-maps the received signal as discussed with respect to the de-mapping module 118 in transceiver 100.

The transceiver 800 includes 2N_(t) transmit antennas 160 for N_(r) receive antennas 162. Collecting the received SC-FDMA symbols into one vector produces

$\begin{matrix} \begin{matrix} {\begin{bmatrix} y_{1_{oc}} \\ y_{2_{oc}} \\ \vdots \\ y_{N_{r_{oc}}} \end{bmatrix} = {{\begin{bmatrix} \Lambda_{11} & \Lambda_{21} & \ldots & \Lambda_{N_{r}1} \\ \Lambda_{12} & \Lambda_{22} & \ldots & \Lambda_{N_{r}2} \\ \vdots & \vdots & \ddots & \vdots \\ \Lambda_{1\; N_{r}} & \Lambda_{2\; N_{r}} & \ldots & \Lambda_{N_{r}N_{r}} \end{bmatrix}\begin{bmatrix} \chi_{1_{oc}} \\ \chi_{2_{oc}} \\ \vdots \\ \chi_{N_{r_{oc}}} \end{bmatrix}} + \begin{bmatrix} N_{1_{oc}} \\ N_{2_{oc}} \\ \vdots \\ N_{N_{r_{oc}}} \end{bmatrix}}} \\ {\overset{\bigtriangleup}{=}{{\Lambda\;\chi_{oc}} + N_{oc}}} \end{matrix} & (101) \end{matrix}$ where Λ_(ij) is the frequency-domain channel matrix from the i^(th) user's transmit antenna 160 to the ^(jth) receive antenna 162 having an Alamouti-like structure, and

_(i) _(oe) is the received vector after SFBC at the i^(th) antenna. Since N_(t)=2×N_(r), and the feedforward coefficient matrices can have a structure that corresponds to the structure of the channel matrix, Λ, the output of the equalizer can be represented by

$\begin{matrix} {\begin{bmatrix} {\overset{\Cup}{\chi}}_{1_{oc}} \\ {\overset{\Cup}{\chi}}_{2_{oc}} \\ \vdots \\ {\overset{\Cup}{\chi}}_{N_{r_{oc}}} \end{bmatrix} = {{\begin{bmatrix} \Upsilon_{1,1} & \Upsilon_{1,2} & \ldots & \Upsilon_{1,N_{r}} \\ \Upsilon_{2,1} & \Upsilon_{2,1} & \ldots & \Upsilon_{2,N_{r}} \\ \vdots & \vdots & \ddots & \vdots \\ \Upsilon_{N_{r},1} & \Upsilon_{N_{r},2} & \ldots & \Upsilon_{N_{r},N_{r}} \end{bmatrix}\left\lbrack \begin{matrix} y_{1_{oc}} \\ y_{2_{oc}} \\ \vdots \\ y_{N_{r_{oc}}} \end{matrix} \right\rbrack} + {\quad{\begin{bmatrix} \Psi_{1,1} & \Psi_{1,2} & \ldots & \Psi_{1,N_{r}} \\ \Psi_{2,1} & \Psi_{2,1} & \ldots & \Psi_{2,N_{r}} \\ \vdots & \vdots & \ddots & \vdots \\ \Psi_{N_{r},1} & \Psi_{N_{r},2} & \ldots & \Psi_{N_{r},N_{r}} \end{bmatrix}\begin{bmatrix} D_{1_{oc}} \\ D_{2_{oc}} \\ \vdots \\ D_{N_{r_{oc}}} \end{bmatrix}}}}} & (102) \end{matrix}$ where

_(i) _(oe) is equal to X_(i) _(oe) in training mode and the frequency-domain decisions on X̆_(i) _(oe) in the decision-directed mode.

_(i,j) and Ψ_(i,j) are the Alamouti-like matrices of the feedforward and diagonal matrices of the feedback coefficients of the equalizer, respectively, For example,

$\begin{matrix} {{\Upsilon_{i,j} = \begin{bmatrix} \Upsilon_{i,j}^{1} & \Upsilon_{i,j}^{2} \\ \Upsilon_{i,j}^{2*} & {- \Upsilon_{i,j}^{1*}} \end{bmatrix}},{\Psi_{i,j} = \begin{bmatrix} \Psi_{i,j}^{1} & 0 \\ 0 & {- \Psi_{i,j}^{2*}} \end{bmatrix}}} & (103) \end{matrix}$

_(i,j) ¹,

_(i,j) ², Ψ_(i,j) ¹, and Ψ_(i,j) ² are diagonal matrices, according to certain embodiments. However, a closed-form approach to computing the equalizer output may not be used in an adaptive approach. For example for the i^(th) user,

$\begin{matrix} {{\overset{\Cup}{\chi}}_{i_{oc}} = {\begin{bmatrix} {\overset{\Cup}{\chi}}_{i_{o}} \\ {\overset{\Cup}{\chi}}_{i_{c}} \end{bmatrix} = {\sum\limits_{j = 1}^{N_{r}}\left( {{\Upsilon_{i,j}y_{j_{oc}}} + {\Psi_{i,j}D_{j_{oc}}}} \right)}}} & (104) \end{matrix}$ Equation (104) can also be rewritten as

$\begin{matrix} \begin{matrix} {\begin{bmatrix} {\overset{\Cup}{\chi}}_{i_{o}} \\ {\overset{\Cup}{\chi}}_{i,c}^{*} \end{bmatrix} = {\sum\limits_{j = 1}^{N_{r}}\left( {{\begin{bmatrix} {{diag}\left( y_{j_{o}} \right)} & {{diag}\left( y_{j_{c}}^{*} \right)} \\ {- {{diag}\left( y_{j_{c}} \right)}} & {{diag}\left( y_{j_{o}}^{*} \right)} \end{bmatrix}\left\lbrack \begin{matrix} \mathcal{F}_{i,j}^{1} \\ \mathcal{F}_{i,j}^{2} \end{matrix} \right\rbrack} +} \right.}} \\ {\left. \quad{\begin{bmatrix} {{diag}\left( D_{j_{o}} \right)} & 0 \\ 0 & {{diag}\left( D_{j_{c}}^{*} \right)} \end{bmatrix}\begin{bmatrix} B_{i,j}^{1} \\ B_{i,j}^{2} \end{bmatrix}} \right)} \\ {\overset{\bigtriangleup}{=}{{\sum\limits_{j = 1}^{N_{r}}{y_{oc}^{i}\mathcal{F}_{i,j}}} + {D_{oc}^{j}B_{i,j}}}} \end{matrix} & (105) \end{matrix}$ where

_(i,j) ¹ and

_(i,j) ² are the vectors having the diagonal elements of

_(i,j) ¹ and

_(i,j) ², respectively. Likewise,

_(i,j) ¹ and

_(i,j) ² are the vectors having the diagonal elements of Ψ_(i,j) ¹ and Ψ_(i,j) ², respectively. Also,

_(i,j)=[

_(i,j) ^(1T) . . .

_(i,j) ^(2T)]^(T) and

_(i,j)=[

_(i,j) ^(1T) . . .

_(i,j) ^(2T)]^(T). For all users,

$\begin{matrix} {\begin{bmatrix} {\overset{\Cup}{\chi}}_{1_{oc}} \\ \vdots \\ {\overset{\Cup}{\chi}}_{N_{r_{oc}}} \end{bmatrix} = {\sum\limits_{j = 1}^{N_{r}}\left( {{\begin{bmatrix} y_{oc}^{j} & \; & \; \\ \; & \ddots & \; \\ \; & \; & y_{oc}^{j} \end{bmatrix}\begin{bmatrix} \mathcal{F}_{1,j} \\ \vdots \\ \mathcal{F}_{N_{r},j} \end{bmatrix}} + {\begin{bmatrix} D_{oc}^{j} & \; & \; \\ \; & \ddots & \; \\ \; & \; & D_{oc}^{j} \end{bmatrix}\begin{bmatrix} B_{1,j} \\ \vdots \\ B_{N_{r},j} \end{bmatrix}}} \right)}} & (106) \end{matrix}$ which can be rewritten in compact form as

$\begin{matrix} {{\overset{\Cup}{\chi}}_{oc} = {\sum\limits_{j = 1}^{N_{r}}\left\lbrack {{{\overset{\prime}{Z}}_{j}\mathcal{F}_{j}} + {{\overset{\prime}{D}}_{j}B_{j}}} \right\rbrack}} & (107) \end{matrix}$ At the k^(th) instant, the output from the equalizer 170 is computed as {hacek over (X)} _(oe,k)

_(k)

_(k)+

_(k)

_(k)  (108) where

_(k)=[

_(1,k) . . .

_(N) _(r) _(,k)],

_(k)=[

_(1,k) . . .

_(N) _(r) _(,k)],

_(k)=[

_(1,k) ^(T) . . .

_(N) _(r) _(,k) ^(T)]^(T) and

_(k)=[

_(1,k) ^(T) . . .

_(N) _(r) _(,k) ^(T)]^(T) In addition, the output of the equalizer 170 is fed to a M-point IDFT module 172 where the equalized symbols are transformed back to the time domain via IDFT.

The desired response can be defined as D_(k)=[D_(1,k) ^(T) . . . D_(N) _(r,k) ^(T)]^(T), and

$D_{i,k} = \left\{ \begin{matrix} {\begin{bmatrix} \chi_{i_{o},k} \\ \chi_{i_{c},k}^{*} \end{bmatrix},} & {{for}\mspace{14mu}{training}\mspace{14mu}{mode}} \\ {{\hat{\chi}}_{i_{oc},k},} & {{for}\mspace{14mu}{decision}\text{-}{directed}\mspace{14mu}{mode}} \end{matrix} \right.$ Now

_(k) can be represented by equation (58),

_(k) as in equation (46), and ε_(k) as in equation (76).

In addition, the complexity of determining the equalizer output can be reduced based on the structure of the matrices used in the equalizer output determination. For example, starting with k=0 and using P₀ ¹=ϵ⁻¹I_((N) _(r) ₎ ₂ _(M). P₁ ¹ can be represented by P ₁ ¹=λ⁻¹[ϵ⁻¹ I _((N) _(r) ₎ ₂ _(M)−λ⁻¹ϵ⁻¹ I _((N) _(r) ₎ ₂ _(M)

₁ ^(H)(I _(N) _(r) _(M)+λ⁻¹ϵ⁻¹

₁

₁ ^(H))⁻¹

₁ϵ⁻¹ I _((N) _(r) ₎ ₂ _(M)]  (109)

In addition,

₁

₁ ^(H)=[|

_(1,1)|²+ . . . +|

_(N) _(r) _(,1)|²], where

_(i,k)

_(i,k) ^(H)=diag[|

_(oe) ^(i)|², . . . , |

oe^(i)|²]. Since |

_(k)|² has a diagonal structure, computing equation (109) does not include performing matrix inversion, which can be a costly computational operation. In addition,

₁ ^(H)(I_(N) _(r) _(M)+λ⁻¹ϵ⁻¹

₁

₁ ^(H))⁻¹

₁=Ø is a (N_(r) ²M×N_(r) ²M) matrix, where each entry Ø_(i,j)=

_(i,1) ^(H)(I_(N) _(r) _(M)+λ⁻¹ϵ⁻¹

₁

₁ ^(H))⁻¹

₁ is a N_(r)M×N_(r)M block diagonal matrix. Each sub-matrix in Ø_(i,j) is a M×M block matrix, which further includes four

$\frac{M}{2} \times \frac{M}{2}$ diagonal matrices. Therefore, the structure of

₁ ¹ corresponds to the structure of Ø where each sub-matrix

_(1,(i,j)) ¹=λ⁻¹[ϵ⁻¹I_(N) _(r) _(M)−λ⁻¹ϵ⁻²I_(N) _(r) _(M)Ø_(i,j)] is a N_(r)M× N_(r)M block diagonal matrix.

$\begin{matrix} {{{{For}\mspace{14mu} k} = 1},{P_{2}^{1} = {\lambda^{- 1}\left\lbrack {P_{1}^{1} - {\lambda^{- 1}P_{1}^{1}{Z_{2}^{H}\left( {I_{N_{r}M} + {\lambda^{- 1}Z_{2}P_{1}^{1}Z_{2}^{H}}} \right)}^{- 1}Z_{2}P_{1}^{1}}} \right\rbrack}}} & (110) \\ {{{In}\mspace{14mu}{addition}},{\left( {I_{N_{r}M} + {\lambda^{- 1}Z_{2}P_{1}^{1}Z_{2}^{H}}} \right)^{- 1} = \begin{bmatrix} A_{1}^{- 1} & \; & \; \\ \; & \ddots & \; \\ \; & \; & A_{N_{r}}^{- 1} \end{bmatrix}}} & (111) \end{matrix}$ where

${A_{i} = \begin{bmatrix} A_{i,1} & A_{i,2} \\ A_{i,3} & A_{i,4} \end{bmatrix}},$ and each entry A_(i,j) is a diagonal matrix. Therefore, the inverse of A_(i) can be determined using block matrix inversion described in T. Kailath, Linear Systems. Englewood Cliffs, N.J.: Prentice Hall, 1980, where all of the sub-matrices are diagonal, therefore, the inversions can be determined via scalar inversion, which allows for a reduction in computational complexity. For k>1, the structure of

_(k) ¹ corresponds to the structure of

₁ ¹. In addition, the computation of

_(k) ² follows a process that corresponds to the computation of

_(k) ¹ described herein for the spatially-multiplexed (SM) case.

FIG. 8 is a block diagram representation of a receiver, according to certain embodiments. The receiver 104 may include an antenna 820, reception circuitry 802, the equalizer 170, a demodulator 804, a receiver data processor 806, a processor 808, a memory 810, and a bus 818. The receiver 104 may be part of the base station 20. At the base station 250, the reception circuitry 802 may receive an RF signal from the antenna 820. The reception circuitry 802 may also receive other types of signals such as microwave, optical, or the like. The base station 250 may include a plurality of antennas. The reception circuitry 802 provides the RF signal to the equalizer 170. The equalizer 170 may use any of the equalization techniques described previously to provide the output samples to the demodulator 804. The demodulator 804 process the output samples and provides symbol estimates to the receiver data processor 806. The receiver data processor 806 process the symbol estimates and provides decoded data. The memory 810 may store data and program code for the receiver 104. The memory 810 may store past data performance, noise level, number of particles used, and filter coefficients. The equalizer 170, the demodulator 804, the receiver data processor 806, the processor 808, and the memory 810 are connected to each other by the bus 818. The memory 810 may include, for example, a magnetic storage device such as a Hard Disk Drive (HDD), a semiconductor storage device, an optical storage device, or a magneto-optical storage device. The memory 810 may be utilized as working memory by the processor 808 while executing process and algorithms of the present disclosure. The antenna 820 receives electromagnetic wave signals between base stations, mobile devices, for performing radio-based communication, such as the various forms of cellular telephone communication. The processor 808 may be a Xenon or Core processor from Intel of America or an Opteron processor from AMD of America, or may be other processor types that would be recognized by one of ordinary skill in the art. Alternatively, the processor 808, the receiver data processor 806, and the equalizer 170 may be implemented on an FPGA, ASIC, PLD or using discrete logic circuits, as one of ordinary skill in the art would recognize.

The hardware description above, exemplified by any one of the structure examples shown in FIG. 8 constitutes or includes specialized corresponding structure that is programmed or configured to perform the adaptation algorithm 300 shown in FIG. 3.

The system of the present disclosure may be further programmed to execute any algorithm and method disclosed in N. Iqbal, N. Al-Dhabir, and A. Zerguine, “Adaptive Frequency-Domain Decision Feedback Equalization using constraint-based RLS for Uplink LTE” and N. Iqbal, N. Al-Dhabir, and A. Zidouri, “Adaptive Frequency-Domain RLS DFE for Uplink MIMO SC-FDMA,” the entire disclosures of which are incorporated herein by reference.

According to certain embodiments, Minimum Mean Square Error (MMSE), transient, steady-state, tracking and computational complexity analyses can be performed for the implementations of the SC-FDMA described herein. In the derivations described further herein, the data sequences (both transmitted data and detected data) can be assumed to be independent and identically distributed (i.i.d) with zero mean, and independent of the noise. The optimal MMSE equalizer weights Wo for i^(th) frequency bin are given as described in N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005, the entire contents of which is incorporated by reference in its entirety and C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010, the entire contents of which are incorporated by reference in its entirety, for LE,

${F^{o}(i)} = \frac{\sigma_{x}^{2}\Lambda\;(i)^{*}}{{\sigma_{x}^{2}{{\Lambda(i)}}^{2}} + {\frac{N}{M}\sigma_{N}^{2}}}$ and for DFE

$\left. {{{F^{o}(i)} = \frac{\sigma_{x}^{2}\Lambda\;(i)^{*}}{{\sigma_{x}^{2}{\sum\limits_{j = 0}^{M - 1}{{\Lambda(j)}}^{2}}} + {\frac{N}{M}\sigma_{N}^{2}}}},{{B^{o}(i)} = {{{- {F^{o}(i)}}{\Lambda(i)}} + {\frac{1}{M}{\sum\limits_{j = 0}^{M - 1}{{F^{o}(j)}{\Lambda(j)}}}}}}} \right)$ Using these weights, the corresponding i^(th) frequency bin MMSE for the LE (J_(L)(i)) and the DFE (J_(D)(i)) can be derived, respectively, and expressed as

$\begin{matrix} {{J_{L}(i)} = \frac{N\;\sigma_{N}^{2}{\sigma_{x}^{2}\left( {{M{{\Lambda(i)}}^{2}\sigma_{x}^{2}} + {N\;\sigma_{N}^{2}}} \right)}}{\left( {{\sigma_{x}^{2}{{\Lambda(i)}}^{2}M} + {N\;\sigma_{N}^{2}}} \right)^{2}}} & (112) \end{matrix}$ and

$\begin{matrix} {{\xi_{k + 1}(i)} = {{\xi_{k + 1}^{o}(i)} + {{a_{k + 1}(i)}\underset{\underset{{\overset{\sim}{W}}_{k}{(i)}}{︸}}{\left\lbrack {{W^{o}(i)} - {W_{k}(i)}} \right\rbrack}}}} & (113) \end{matrix}$

For transient analysis, the transient behavior of the RLS AFD-DFE can be derived in a stationary environment by assuming that the forgetting factor λ is unity. The MSE of the RLS AFD-DFE at instant k+1 is represented by J _(k+1) ⁽ i)=E|ξ _(k+1)(i)|²  (114) The a priori estimation output error ξ_(k+1)(i) can be expressed as

$\begin{matrix} {{J_{D}(i)} = \frac{N\;\sigma_{N}^{2}{\sigma_{x}^{2}\left( {{M{{A(i)}}^{2}\sigma_{x}^{2}} + {N\;\sigma_{N}^{2}}} \right)}}{\left( {{\sigma_{x}^{2}{\sum\limits_{j = 0}^{M - 1}{{\Lambda(j)}}^{2}}} + {N\;\sigma_{N}^{2}}} \right)^{2}}} & (115) \end{matrix}$ where X_(k)(t)=a_(k)(i)

^(o)(i)+ξ_(k) ^(o)(i), a_(k)(i)=[

_(k)(i) D_(k)(i)] and

_(k)(i)=[

_(k)(i) B_(k)(i)]^(T)·g equation (115) into equation (114) and expanding terms results in

$\begin{matrix} {{J_{k + 1}^{r}(i)} = {\underset{\underset{J_{D}{(i)}}{︸}}{E{{\xi_{k + 1}^{o}(i)}}^{2}} + {{{Ea}_{k + 1}(i)}{{\overset{\sim}{W}}_{k}(i)}{{\overset{\sim}{W}}_{k}^{H}(i)}{a_{k + 1}^{H}(i)}} + {{{Ea}_{k + 1}(i)}{{\overset{\sim}{W}}_{k}(i)}{\xi_{k + 1}^{o*}(i)}} + {E\;{\xi_{k + 1}^{o}(i)}{{\overset{\sim}{W}}_{k}^{H}(i)}{a_{k + 1}^{H}(i)}}}} & (116) \end{matrix}$ The second expectation in equation (116) can be expressed as

$\begin{matrix} \begin{matrix} {{E\;{a_{k + 1}(i)}{{\overset{\sim}{W}}_{k}(i)}{{\overset{\sim}{W}}_{k}^{H}(i)}{a_{k + 1}^{H}(i)}} = {E\;{Tr}\left\{ {{a_{k + 1}(i)}{{\overset{\sim}{W}}_{k}(i)}{{\overset{\sim}{W}}_{k}^{H}(i)}{a_{k + 1}^{H}(i)}} \right\}}} \\ {= {E\;{Tr}\left\{ {{{\overset{\sim}{W}}_{k}(i)}{{\overset{\sim}{W}}_{k}^{H}(i)}{a_{k + 1}^{H}(i)}{a_{k + 1}(i)}} \right\}}} \\ {= {{TrE}\left\{ {{{\overset{\sim}{W}}_{k}(i)}{{\overset{\sim}{W}}_{k}^{H}(i)}{a_{k + 1}^{H}(i)}{a_{k + 1}(i)}} \right\}}} \end{matrix} & (117) \end{matrix}$ where Tr{⋅} is the trace of a matrix. Using an assumption that the product

_(k)(i)

_(k) ^(H)(i) varies at a rate that is slower than the product a_(k+1) ^(H)(i)a_(k+1)(i), E

_(k)(i)

_(k) ^(H)(i)a _(k+1) ^(H)(i)a _(k+1)(i)≈Tr{E

_(k)(i)

_(k) ^(H)(i)Ea _(k+1) ^(H)(i)a _(k+1)(i)}  (118) where Ea_(k+1) ^(H)(i)a_(k+1)(i)=R

(i) is an input correlation matrix having diagonal structure, i.e.,

${R_{A}(i)} = {{{diag}\left( \left\lbrack {{{{\Lambda(i)}}^{2}\sigma_{x}^{2}} + {\frac{N}{M}\sigma_{N}^{2}\mspace{14mu}\sigma_{x}^{2}}} \right\rbrack \right)}.}$ To determine E

_(k)(i)

_(k) ^(H)(i),

_(k)(i)=Θ_(k) ⁻¹(i)R _(k)(i)  (119) where

${R_{k}(i)} = {\sum\limits_{j = 0}^{k}{{a_{k}^{H}(i)}{{X_{j}(i)}.}}}$ Using the values of Θ_(j)(i), R_(k)(i), and X_(k)(i) in equation (119) and ignoring the initial conditions,

_(k)(i) can be represented by

$\begin{matrix} {{{\overset{\sim}{W}}_{k}(i)} = {{{- {\Theta_{k}^{- 1}(i)}}{\sum\limits_{j = 0}^{k + 1}{{a_{j}^{H}(i)}\;{\xi_{j}^{o}(i)}}}} + {{\theta_{k}^{- 1}(i)}\;{\alpha_{k + 1}\left\lbrack {0\mspace{14mu} 1} \right\rbrack}^{T}}}} & (120) \end{matrix}$ Now,

$\begin{matrix} {{{{\overset{\sim}{W}}_{k}(i)}} \leq {\underset{\underset{{\overset{\sim}{W}}_{k}^{\prime}{(i)}}{︸}}{{- {\Theta_{k}^{- 1}(i)}}{\sum\limits_{j = 0}^{k + 1}{{a_{j}^{H}(i)}\;{\xi_{j}^{o}(i)}}}}}} & (121) \end{matrix}$ Therefore, the weight-error correlation matrix of

_(k)′(i) is given as

$\begin{matrix} {{{{\overset{\sim}{EW}}_{k}^{\prime}(i)}{{\overset{\sim}{W}}_{k}^{\prime H}(i)}} = {E\;{\Theta_{k}^{- 1}(i)}{\sum\limits_{j = 0}^{k + 1}{{a_{j}^{H}(i)}{a_{j}(i)}{\Theta_{k}^{- 1}(i)}{\xi_{j}^{o}(i)}{\xi_{j}^{o*}(i)}}}}} & (122) \end{matrix}$ Using an assumption that a_(k)(i), and therefore Θ_(k) ⁻¹(i), is independent of the noise ξ_(k) ^(o)(i), equation (122) can be expressed as a product of two expectations a follows:

$\begin{matrix} \begin{matrix} {{E{{\overset{\sim}{W}}_{k}^{\prime}(i)}{{\overset{\sim}{W}}_{k}^{\prime H}(i)}} = {E\;{\Theta_{k}^{- 1}(i)}{\sum\limits_{j = 0}^{k + 1}{{a_{j}^{H}(i)}{a_{j}(i)}{\Theta_{k}^{- 1}(i)}E\;{\xi_{j}^{o}(i)}{\xi_{j}^{o*}(i)}}}}} \\ {= {{J_{D}(i)}E\;{\Theta_{k}^{- 1}(i)}}} \end{matrix} & (123) \end{matrix}$ Assuming ergodicity, the ensemble-average correlation matrix of the input of the AFD-DFE can be expressed as

${R_{A}(i)} = {\frac{1}{k}{{\Theta_{k}(i)}.}}$ Therefore, the weight-error correlation matrix can be reduced to

${E{{\overset{\sim}{W}}_{k}(i)}{{\overset{\sim}{W}}_{k}^{H}(i)}} = {\frac{1}{k}{J_{D}(i)}{{R_{A}^{- 1}(i)}.}}$ Therefore,

$\begin{matrix} {{{{{Ea}_{k + 1}(i)}{{\overset{\sim}{W}}_{k}^{\prime}(i)}{{\overset{\sim}{W}}_{k}^{\prime\; H}(i)}{a_{h + 1}^{H}(i)}} \approx {\frac{1}{k}{J_{D}(i)}{Tr}\left\{ {{R_{A}(i)}{R_{A}^{- 1}(i)}} \right\}}} = {\frac{2}{k}{J_{D}(i)}}} & (124) \end{matrix}$ The third and fourth expectations in equation (116) are zero because

_(k)(i) depends on past values of a_(k+1)(i) and ξ_(k+1) ^(o)(i). Also, a_(k+1)(i) and ξ_(k+1) ^(o)(i) are statistically independent, and ξ_(k+1) ^(o)(i) has zero mean. Therefore,

$\begin{matrix} {{J_{k + 1}^{\prime\; r}(i)} = {{{J_{D}(i)}\left\lbrack {1 + \frac{2}{k}} \right\rbrack} \geq {J_{k + 1}^{r}(i)}}} & (125) \end{matrix}$ To compare the RLS AFD-DFE (both constrained and unconstrained) and LMS AFD-DFE, the MSE of the LMS AFE-DFE can be shown based on S. Haykin, Adaptive Filter Theory. Prentice Hall, Upper-Saddle River, N.J., 4th ed., 2002, the entire contents of which is herein incorporated by reference, to be

$\begin{matrix} {{J_{k + 1}^{\prime\; t}(i)} = {{{J_{D}(i)}\left\lbrack {1 + {\mu{\sum\limits_{j = 0}^{1}\frac{r_{j}}{2 - {\mu\; t_{j}}}}}} \right\rbrack} + {\sum\limits_{j = 0}^{1}{{r_{j}\left( {{{v_{j}(0)}}^{2} - \frac{\mu\;{J_{D}(i)}}{2 - {\mu\; r_{j}}}} \right)}\left( {1 - {\mu\; r_{j}}} \right)^{2\; k}}}}} & (126) \end{matrix}$ where r_(j) is the j^(th) eigenvalue of the correlation matrix R

and υ(k+1)=[υ₀(k+1), [υ₁(k+1)]^(T)=(I₂−μ

_(k+1)(i)^(H)ε_(k+1)(i). The evolution of J_(k+1)′^(l)(i) with step size μ is governed by exponential quantity (1−μr_(j))^(2k). Therefore, the RLS AFD-DFE (for both constrained and unconstrained cases) converges faster than the LMS AFD-DFE. The simulation results discussed further herein illustrate that the RLS AFD-DFE converges more quickly than the LMS AFD-DFE.

For steady-state analysis, the update constrained recursion of equation (52) for the i^(th) frequency bin can be written as

_(k+1)(i)=

_(k)(i)+

_(k+1)(i)(a _(k+1) ^(H)(i)ξ_(k+1)(i)−α_(k+1) G ^(T))  (127) Likewise, the update unconstrained recursion of equation (53) for the i^(th) frequency bin can be written as

_(k+1)(i)=

_(k)(i)+

_(k+1)(i)a _(k+1) ^(H)(i)ε_(k+1)(i)  (128) In terms of the weight error vector

_(k)′(i),

_(k+1)′(i)=

_(k)′(i)−

_(k+1)(i)a _(k+1) ^(H)(i)ε_(k+1)(i)  (129) Multiplying equation (129) by a_(k+1)(i) from the left, the weight error vector can be written in terms of a priori estimation error ξ_(k+1) ^(a)(i) and a posteriori estimation error ξ_(k+1)

(i) as follows:

$\begin{matrix} {\underset{\underset{\xi_{k + 1}^{\varphi}{(i)}}{︸}}{{a_{k + 1}(i)}{{\overset{\sim}{W}}_{k + 1}^{\prime}(i)}} = {\underset{\underset{\xi_{k + 1}^{a}{(i)}}{︸}}{{a_{k + 1}(i)}{{\overset{\sim}{W}}_{k}^{\prime}(i)}} - {\underset{\underset{{{a_{k + 1}{(i)}}}_{P}}{︸}}{{a_{k + 1}(i)}{P_{k + 1}(i)}{a_{k + 1}^{H}(i)}}{\xi_{k + 1}(i)}}}} & (130) \end{matrix}$ where |⋅∥_(P) stands for the squared-weighted Euclidean norm of a vector. Combining equations (129) and (130) to eliminate ξ_(k+1),

$\begin{matrix} {{{{\overset{\sim}{W}}_{k + 1}^{\prime}(i)} + {{P_{k + 1}(i)}{a_{k + 1}^{H}(i)}\underset{\underset{{\overset{\_}{a}}_{k + 1}{(i)}}{︸}}{\left( {{a_{k + 1}(i)}}_{P} \right)^{\dagger}}{\xi_{k + 1}^{a}(i)}}} = {{{\overset{\sim}{W}}_{k}^{\prime}(i)} + {{P_{k + 1}(i)}{\alpha_{k + 1}^{H}(i)}\left( {{a_{k + 1}(i)}}_{P} \right)^{\dagger}{\xi_{k + 1}^{p}(i)}}}} & (131) \end{matrix}$ where (⋅)^(†) represents the pseudo-inverse. By equaling the energies (squared Euclidean norms) of both sides of equation (131) with [

_(k+1)(i)]⁻¹ as a weighting matrix, the energy conservation relation becomes ∥

_(k+1)′(i)∥

⁻¹ ² +ā _(k+1)(i)|ξ_(k+1) ^(a)(i)|²=∥

_(k)′(i)∥

⁻¹ ² +ā _(k+1)(i)|ξ_(k+1)

(i)|²  (132) Taking the expectation of equation (132) and using the steady-state approximations,

$\begin{matrix} {{{{{EP}_{k + 1}(i)} \approx {\left( {1 - \lambda} \right)R_{A}^{- 1}}} = P},{{E{{\overset{\sim}{W}}_{k + 1}^{\prime}(i)}} = {{E{{\overset{\sim}{W}}_{k}^{\prime}(i)}\mspace{14mu}{and}\mspace{14mu} E{{{\overset{\sim}{W}}_{k + 1}^{\prime}(i)}}_{P^{- 1}}^{2}} = {E{{{\overset{\sim}{W}}_{k}^{\prime}(i)}}_{P^{- 1}}^{2}}}},\mspace{20mu}{{E{{\overset{\_}{a}}_{k + 1}(i)}{{\xi_{k + 1}^{a}(i)}}^{2}} = {E{{\overset{\_}{a}}_{k + 1}(i)}{{\xi_{k + 1}^{p}(i)}}^{2}}},\left. k\rightarrow\infty \right.} & (133) \end{matrix}$ Substituting ξ_(k+1)

^(p)(i) from equation (130) into equation (133) results in Eā _(k+1)(i)|ξ_(k+1) ^(a)(i)|² =Eā _(k+1)(i)|ξ_(k+1) ^(a)(i)−∥a _(k+1)(i)∥

ξ_(k+1)(i)|² , k→∞  (134) which upon expansion and simplification reduces to E∥a _(k+1)(i)∥

|ξ_(k+1)(i)|²=2Re(Eξ _(k+1) ^(a)*(i)ξ_(k+1)(i)), k→∞  (135) As ξ_(k+1)(i)=ξ_(k+1) ^(o)(i)+ξ_(k+1) ^(a)(i), equation (135) becomes

$\begin{matrix} {{{{{J_{D}(i)}E{{a_{k + 1}(i)}}_{P}} + {E{{a_{k + 1}(i)}}_{P}{{\xi_{k + 1}^{a}(i)}}^{2}}} = {2\;\underset{\underset{J_{exss}{(i)}}{︸}}{E{{\xi_{k + 1}^{a}(i)}}^{2}}}},\left. k\rightarrow\infty \right.} & (136) \end{matrix}$ where J′_(exss)(i) is the excess mean square error (EMSE). It can be assumed that at steady-state, ∥a_(k+1)(i)∥

is independent of ξ_(k+1) ^(a)(i), which allows separation of expectation E∥a_(k+1)(i)∥

into a product of two expectations as follows: E∥a _(k+1)(i)∥

|ξ_(k+1) ^(a)(i)|² =E∥a _(k+1)(i)∥

E|ξ _(k+1) ^(a)(i)|₂  (137) If

_(k+1)(i) is replaced by the assumed mean value, the following approximation is obtained: E∥a _(k+1)(i)∥

≈Tr{

}=2(1−λ)  (138) Substituting equation (138) into equation (136) results in

$\begin{matrix} {{J_{exss}^{\prime}(i)} = \frac{{J_{D}(i)}\left( {1 - \lambda} \right)}{\lambda}} & (139) \end{matrix}$ Therefore, the MSE at the steady-state J_(ss)(i) is given as

$\begin{matrix} {{J_{ss}^{\prime}(i)} = {{{J_{D}(i)} + {J_{exss}^{\prime}(i)}} = \frac{J_{D}(i)}{\lambda}}} & (140) \end{matrix}$

Another cause of ICI is the Doppler Effect, which can occur due to motion of a user of an external device, such as a mobile device. This motion causes the channel h^((m)) to be time variant with each tap h_(i) ^((m)), i=1; . . . ; L(m) being modeled as a Wide Sense Stationary (WSS) narrow-band complex Gaussian process, independent of the other taps. Hence, the amplitude |h_(i) ^((m))| is assumed to be Rayleigh distributed, i.e., P(|h _(i) ^((m))|)=−h _(i) ^((m)) |e ^(−|h) ^(i) ^((m)) ^(|) ² , |h _(i) ^((m))|≥0  (141) and the phase of the motion is is uniformly distributed, i.e.,

$\begin{matrix} {{{P\left( {\angle\; h_{i}^{(m)}} \right)} = \frac{1}{2\;\pi}},{{- \lambda} \leq {\angle\; h_{i}^{(m)}} \leq \pi}} & (142) \end{matrix}$ The autocorrelation function of the h_(i) ^((m)) can be modeled as a zeroth-order Bessel function of the first kind, i.e.,

(n)

_(o)(2πf _(d) t _(s) n), n= . . . ,−1,0,1 . . .  (143) where f_(d) is the maximum Doppler frequency, t_(s) is the sampling time, and

_(o) can be defined as

$\begin{matrix} {{J_{o}(z)}\overset{\bigtriangleup}{=}{\frac{1}{\pi}{\int_{0}^{\pi}{{\cos\left( {z\;\sin\;\theta} \right)}d\;\theta}}}} & (144) \end{matrix}$ The Doppler frequency f_(d) is related to the user speed υ and carrier frequency f_(c) as f_(d)=υf_(c)/c, where c is the speed of light. The Rayleigh fading channel can be generated using Jake's model.

For time-varying channels, a first-order random walk model can be adopted for the variation in the tap weight vector

_(k) ^(o). The model assumes that

_(k) ^(o) undergoes random variations of the form

_(k+1) ^(o)(i)=

_(k) ^(o)(i)+q _(k+1)(i)  (145) where q_(k)(i)=[f_(L)(i)q_(k) ^(F)(i)f_(L)(i)q_(k) ^(B)(i)]^(T) and f_(L)(i) is the i^(th) row of partial (M×L) DFT matrix. q_(k) ^(F)(i) and q_(k) ^(B)(i) are the time-domain random column vectors of length L with zero mean and correlation matrix Q_(t)=(1−

²(1))I_(L) as described in A. H. Sayed, Fundamentals of Adaptive Filtering. New York: Wiley, 2003. Assuming that q_(k) ^(F)(i) and q_(k) ^(B)(i) are independent and noting that f_(L)(i)f_(L)(i)^(H)=L/M, the covariance matrix of q_(k)(i), i=1, . . . , M, is

$Q = {\frac{L}{M}\left( {1 - {R^{2}(1)}} \right){I_{2\; L}.}}$

Now, defining

_(k) ^(o)(i)−

′_(k)(i)=

_(k)′(i), the energy conservation relation of equation (132) becomes

_(k+1) ^(o)(i)−

′_(k−1)(i)

⁻¹ ² +ā _(k+1)(i)ξ_(k+1) ^(a)(i)²=

_(k+1) ^(o)(i)−

′_(k)(i)∥

⁻¹ ² +ā _(k+1)(i)ξ_(k+1)

(i)|²  (146) where ξ_(k+1)

(i)=a_(k+1)(i)[

_(k+1) ^(o)(i)−

′_(k+1)(i)] and ξ_(k+1) ^(a)(i)=a_(k+1)(i)[

_(k+1) ^(o)(i)−

′_(k)(i)]. Moreover, the random walk model of equation (146) allows E∥

_(k+1) ^(o)(i)−

′_(k)(i)∥

⁻¹ ² to be related to E∥

′_(k)(i)∥

⁻¹ ² as follows:

$\begin{matrix} \begin{matrix} {{E{{{W_{k + 1}^{o}(i)} - {W_{k}^{\prime}(i)}}}_{p - 1}^{2}} = {E{{{W_{k}^{o}(i)} + {q_{k + 1}(i)} - {W_{k}^{\prime}(i)}}}_{P^{- 1}}^{2}}} \\ {= {E{{{{\overset{\sim}{W}}_{k}^{\prime}(i)} + {q_{k + 1}(i)}}}_{P^{- 1}}^{2}}} \\ {= {{E{{{\overset{\sim}{W}}_{k}^{\prime}(i)}}_{P^{- 1}}^{2}} + {{q_{k + 1}(i)}}_{P^{- 1}}^{2}}} \end{matrix} & (147) \end{matrix}$ where the last step of equation (147) follows from the fact that

′_(k)(i) is independent of q_(k+1)(i) and uses steady-state assumptions. Next taking the expectation of equation (146) results in E∥

′ _(k+1)(i)∥

⁻¹ ² +Eā _(k+1)(i)|ξ_(k+1) ^(a)(i)|² =E∥

′ _(k)(i)∥

⁻¹ ² +E∥q _(k+1)(i)∥

⁻¹ ² +Eā _(k+1)(i)|ξ_(k+1)

(i)|²  (148) Moreover, q_(k+1)(i) is independent of

_(k+1)(i), so that

$\begin{matrix} \begin{matrix} {{J_{nn}^{\prime}(i)} = {{J_{D}(i)} + \frac{{2\left( {1 - \lambda} \right){J_{DFE}(i)}} + {\frac{1}{\left( {1 - \lambda} \right)}{Tr}\left\{ {QR} \right\}}}{2 - {2\left( {1 - \lambda} \right)}}}} \\ {= {\frac{J_{D}(i)}{\lambda} + \frac{{Tr}\left\{ {QR} \right\}}{2\;{\lambda\left( {1 - \lambda} \right)}}}} \end{matrix} & (150) \end{matrix}$ Solving equation (148) as solved in the steady-state analysis described previously and using equation (149), in the a time-varying environment, the MSE of the RLS AFD-DFE can be represented as

$\begin{matrix} {{E{{q_{k + 1}(i)}}_{P - 1}^{2}} = {{{Tr}\; E\left\{ {{q_{k + 1}(i)}^{H}P^{- 1}{q_{k + 1}(i)}} \right\}} = {{{Tr}\left\{ {QP}^{- 1} \right\}} = {\frac{1}{\left( {1 - \lambda} \right)}{Tr}\left\{ {QR}_{A} \right\}}}}} & (149) \end{matrix}$

According to certain embodiments, the computational complexity of the AFD-DFE for SISO SC-FDMA systems can be compared to the MMSE DFE described in C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010, with perfect channel knowledge. The computational complexity can be determined in terms of a total number of real multiplication operations performed while computing the feedforward and feedback filter coefficients per block (one SC-FDMA block) for complex-valued data.

For example, for the matrices for

_(k+1) ¹ and

_(k+1) ² described by equations (55) and (56), respectively, 13M real multiplications are performed for

_(k+1) ¹ and

_(k+1) ², therefore 26M real multiplications are performed while computing

_(k+1). In addition, 16M real multiplications are performed when computing

_(k+1)(

_(k+1) ^(H)+ε_(k+1)−α_(k+1)G^(T)) in equation (52) and

_(k+1)

_(k+1) ^(H) ε_(k+1) in equation (53). Also, 8M multiplications are performed when computing the error term ε_(k+1) in equation (47). Therefore, 50M multiplications are performed by the processing circuitry when computing the output of the AFD-DFE. For the MMSE DFE described in C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010, 20(M+1)N_(l)+2N_(l)+12M real multiplications are performed, where N_(l) denotes the number of iterations for each block, and N_(l)>1 (the N_(l) iterations are performed to solve the causality problem presented by equation (8), which is described in N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005). Compared with the AFD-LE, the number of computations performed by the AFD-DFE is doubled due to the feedback filter. In addition, the AFD-LE does not perform the iterative procedure that is performed by the AFD-DFE.

The AFD-DFE is also computationally efficient for MIMO SC-FDMA systems as compared to the MIMO MMSE DFE described by B. Dhivagar, K. Kuchi, and K. Giridhar, “An iterative MIMO-DFE receiver with MLD for uplink SC-FDMA,” National Conference on Communications (NCC), pp. 1-4, February 2013, the entire contents of which is incorporated by reference in its entirety, that performs matrix inversions. The number of real multiplications performed for the MIMO AFD-DFE described herein (assuming equal numbers of transmit and receive antennas) are described as follows. For example, for the matrices for

_(k+1) ¹ and

_(k+1) ², 9N_(t) ³M+5N_(t) ²M+N_(t)M real multiplications are performed, therefore, 18N_(t) ³M+10N_(t) ²M+2N_(t)M multiplications are performed when computing

_(k+1). Also, 8N_(t) ²M multiplications are performed when computing the error term ε_(k+1). An additional 8N_(t) ³M+8N_(t) ²M real multiplications are also performed when calculating the equalizer weights

_(k+1).

When computing the output for the MIMO AFD-DFE, 26N_(t) ³M+26N_(t) ²M+2N_(t)M real multiplications are performed. In the case of the MIMO MMSE DFE, the number of multiplications performed for the matrix inversions can be estimated based on an approach described in G. H. Golub and C. F. Van Loan, Matrix Computations. Johns Hopkins University Press, 4th ed., 2013, the entire contents of which is herein incorporated by reference. Table III summarizes the real multiplications of the AFD-DFE and the MMSE DFE for complex valued data.

FIG. 9 is an exemplary graph that shows that the computational complexity of the AFD-DFE is less than that of the MMSE DFE for N_(l)=4 (N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005, also describes that N_(l)>4 does not increase performance of the MMSE DFE). In addition, the AFD-DFE described herein does not depend on channel estimation, which can also be referred to as channel state information (CSI). In the computational complexity determinations shown in Table III and FIG. 9, a known channel is assumed. Because the MMSE DFE depends on channel estimation, the overall complexity will be further increased. Also, pilots may be needed for the MMSE DFE to estimate the channel, which may also increase overhead.

TABLE III Computational complexity of the AFD-DFE and MMSE DFE Structure Real Multiplications SISO AFD-DFE 50M MMSE DFE [16] 20(M + 1)N_(l) + 2N_(l) + 12M (with known channel) MIMO AFD-DFE 26N_(t) ³M + 26N_(t) ² M + 2N_(t)M MMSE DFE 8MN_(t) ³ − (20N_(l))/3 − 6N_(l)N_(t) ² + (62N_(l)N_(t) ³)/3 + [27] (4N_(l)N_(t) ⁴)/3 + 2N_(t) ² + 2N_(t) ³ + (20N_(l)N_(t))/3 + (with known (20MN_(l)N_(t))/3 − 6MN_(l)N_(t) ² + (86MN_(l)N_(t) ³)/3 + channel) (4MN_(l)N_(t) ⁴)/3

Next, the theoretical findings for the constrained AFD-DFE described herein can be validated through simulation. Like LTE systems, the carrier frequency and bandwidth can be set to 2 GHz and 5 MHz, respectively. Other simulation parameters are M=64 and N=1024, therefore, a maximum number of users that the system can support may be K=16. A Quadrature Phase shift Keying (QPSK) modulation scheme may be used, and the channel is frequency selective with 12-paths with each path fading independently, according to the Rayleigh distribution.

FIG. 10 is an exemplary graph depicting performance of the AFD-DFE algorithms described herein, according to certain embodiments. As can be seen in FIG. 10, best performance can be obtained through the use of the AFD-DFE with constraint for Case 2 of the AFD-DFE, while the worst performance is obtained by the AFD-DFE with constraint for Case 1 of the AFD-DFE. In some implementations, case 2 performs best because the input correlation matrix for the feedforward filter is unaffected by the decision errors. In addition, FIG. 10 also illustrates that in Case 1, when the AFD-DFE is switched to decision-directed mode at approximately 100 iterations, the MSE increases to be higher than the MSE for the other illustrated methods. In addition, the computational complexity of the AFD-DFE (Case 1) is higher than that of the AFD-LE but leads to an increased performance. Also, FIG. 10 shows that the performance of the RLS-based AFD-DFE performs better than that of the LMS-based AFD-DFE in terms of convergence speed and MSE. The RLS AFD-DFE converges after fifteen iterations, which can reduce an impact of complexity and latency increase due to adaptation. FIG. 10 also depicts the theoretical curves and close agreement between the theoretical findings and simulations. For example, the theoretical curve of AFD-DFE assumes perfect decisions. For FIGS. 11-18, the simulation results for the Case 2 AFD-DFE are used.

FIG. 11 is an exemplary graph of CRLS and RLS performance, according to certain embodiments. Since in CRLS, b₀ is closer to zero than in RLS helps ensures that the present symbol is not being canceled out.

FIG. 12 is an exemplary graph of Bit Error Rate (BER) performance of the AFD-DFE, according to certain embodiments. As illustrated in FIG. 12, the AFD-DFE with CRLS outperforms the RLS-based AFD-DFE, practical MMSE DFE and MMSE LE with known channel in terms of BER. In addition, minimal error propagation in the AFD-DFE occurs, which is unlike the practical MMSE DFE of C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010 and N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005, with a known channel, which has a higher error propagation due to poor estimation of the correlation between the transmitted data and the decisions. Moreover, as can be seen from FIG. 12, the performance of the constraint-based RLS AFD-DFE corresponds to to that of an ideal MMSE DFE as described in C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010, with a known channel.

In addition, the effect of the Doppler on the SC-FDMA system is illustrated in FIGS. 13 and 14. For this scenario, speed of the user v is varied from 3 km/h to 300 km/h. As can be observed from these two figures, our proposed RLS AFD-DFE is robust to the Doppler Effect. FIG. 14 depicts the theoretical and simulated MSE for different user's velocities, which can apply to both the constrained and unconstrained RLS AFD-DFE.

FIG. 15 is an exemplary graph of SC-FDMA system performance under effect of CFO, according to certain embodiments. For example, if a slip in the frequency occurs, meaning that the transmitter and receiver are no longer synchronized, then the performance of the AFD-DFE can degrade. FIG. 15 shows that the performance of a 3-tap AFD-DFE is better than that of a 1-tap AFD-DFE in large CFO, which can illustrate that the 3-tap AFD-DFE may be more robust to the ICI.

For SFBC SC-FDMA system, independent 12-path Rayleigh fading channels can be used for each transmit/receive antenna pair. FIG. 16 is an exemplary graph of a comparison between E-SFBC and C-SFBC. For example, FIG. 16 shows that performance of E-SFBC with reordering is better than without reordering. Also, E-SFBC outperforms C-SFBC at higher SNRs.

FIG. 17 is an exemplary graph of E-SFBC performance under CFO, according to certain embodiments. With CFO, the 3-tap AFD-DFE has a better performance than 1-tap AFD-DFE. FIG. 18 is an exemplary graph showing the effect of a number of training symbols on MSE for the AFD-DFE. As can be seen from FIG. 18, if 25 percent of symbols in one SC-FDMA block are used for training and rest are used for data during the training phase, then the reduction in MSE may be insignificant. In this way we can reduce the overhead can be reduced by not sending all of the symbols in SC-FDMA block as training during training mode.

Next, the theoretical findings for the unconstrained AFD-DFE described herein can be validated through simulation. Like LTE systems, the carrier frequency and bandwidth can be set to 2 GHz and 5 MHz, respectively. Other simulation parameters are M=16 and N=512, therefore, a maximum number of users that the system can support may be K=32. A Quadrature Phase shift Keying (QPSK) modulation scheme may be used, and the channel is frequency selective with 12-paths with each path fading independently, according to the Rayleigh distribution. For FIGS. 19 to 22, the user velocity v can be set to 3 km/h.

FIG. 19 is an exemplary graph of learning curves for LE and DFE in unconstrained SC-FDMA systems, according to certain embodiments. In one implementation, interleaved mapping is assumed with no CFO, and the SNR is set to 20 dB. As can be seen in FIG. 19, the AFD-DFE outperforms the AFD-LE in this scenario where more than 18 dB gain in MSE is achieved. More importantly, both equalizers have convergence speeds that are approximately equal. The computational complexity of the AFD-DFE is higher than that of the AFD-LE but provides an increase in performance as compared to the AFD-LE. Also, FIG. 19 shows that the RLS-based AFD-DFE has a better performance than that of the LMS-based AFD-DFE with respect to convergence speed and MSE. Convergence is achieved for the RLS after fifteen iterations are performed; which may counteract the resulting complexity and latency increases due to adaptation. FIG. 19 also depicts the theoretical curves for the three algorithms (AFD-DFE with LMS, AFD-LE with RLS, and AFD-DFE with RLS) which shows that the simulation results correspond to the theoretical results. In addition, the theoretical curve of the DFE assumes perfect decisions.

FIG. 20 is an exemplary graph of performance of the AFD-DFE, according to certain embodiments. For example, the Bit Error Rate (BER) performance of the AFD-DFE with RLS, the MMSE DFE, and the LE are shown. The AFD-DFE with RLS outperforms the LE with known channel with respect to BER. Also, the performance of the MMSE-DFE described by C. Zhang, Z. Wang, Z. Yang, J. Wang, and J. Song, “Frequency Domain Decision Feedback Equalization for Uplink SC-FDMA,” IEEE Transactions on Broadcasting, vol. 56, pp. 253-257, June 2010, the entire contents of which is incorporated by reference in its entirety and N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005, the entire contents of which is incorporated by reference in its entirety, with known channel is worst at low SNRs, which may be due to poor estimation of the correlation between the transmitted data and the decisions. Furthermore, the reliability of the proposed AFD-DFE with RLS algorithm can be increased based on a selection of the initial value of the autocorrelation matrix for both the feedforward and feedback sections, which makes the performance of the AFD-DFE with RLS algorithm approach that of the linear equalizer at low SNR's. In addition, an assumption can be made that no constraint is put on the feedback filter unlike N. Benvenuto and S. Tomasin, “Iterative Design and Detection of a DFE in the Frequency Domain,” IEEE Transactions on Communications, vol. 53, pp. 1867-1875, November 2005, B. Ng, C.-t. Lam, and D. Falco, “Turbo frequency-domain equalization for single-carrier broadband wireless systems,” IEEE Transactions on Wireless Communications, vol. 6, pp. 759-767, February 2007 for SISO schemes and G. M. Guvensen and A. O. Yilmaz, “A General Framework for Optimum Iterative Blockwise Equalization of Single Carrier MIMO Systems and Asymptotic Performance Analysis,” IEEE Transactions on Communications, vol. 61, pp. 609-619, February 2013 for MIMO schemes.

FIG. 21 is an exemplary graph of performance of localized and interleaved AFD-DFE, according to certain embodiments. As shown in FIG. 21, the performance of interleaved mapping can exceed the localized mapping because the localized mapping is sub-band dependent. For example, if a user is assigned a sub-band which approaches a deep null of the channel, then the performance may be reduced.

CFO, Ω, can be detrimental to the performance of the AFD-DFE if a slip in frequency occurs and, therefore, can degrade the system's performance. FIG. 21 depicts the performance of the system under two different CFO's, Ω=0 and Ω=0.2 for a single user. Localized mapping can be more sensitive to CFO in the single user case due to interference from adjacent carriers, while there may be no significant performance loss in case of interleaved mapping in worst case (Ω=0.2). Moreover, a clear gain in the performance of AFD-DFE over LE (with perfectly known channel) in case of Ω=0.2, can be seen in the graph of FIG. 21. In the ensuing simulations described herein, the interleaved mapping technique is used due to performance results that do not depend on sub-band assigned to the user.

FIG. 22 is an exemplary graph of AFD-DFE performance under CFO with three users, according to certain embodiments. For the case described by FIG. 22, the CFOs of user 1, user 2 and user 3 are denoted by Ω₁, Ω₂ and Ω₃, respectively. FIG. 22 illustrates the robustness of the AFD-DFE to CFO due to how the performance of the AFD-DFE is maintained even with the worst case CFO of Ω₁=0.1, Ω₂=0.2, Ω₃=0.3 as compared to the best case of Ω₁=Ω₂=Ω₃=0, and the AFD-DFE can achieve predetermined performance levels in the presence of CFO. With CFO, the frequency-domain channel matrix is no longer diagonal. However, the three main diagonals of the frequency-domain channel matrix include a majority of the channel energy which means that most of the interference may occur between adjacent subcarriers. Since the simulations described herein have only three users, the users are not assigned adjacent subcarriers for the case of the distributed subcarrier allocation. More specifically, the distribution of the 512 subcarriers among the three users is done as follows: user 1=[1, 17, 33, 49, . . . , 241], user 2=[4, 20, 36, 52, . . . , 244], user 3=[7, 23, 39, 55, . . . , 247], which can mitigate the effect of CFO. However, as the number of users increases to where the system operates under full load conditions, CFO may be unavoidable because the freedom in assigning non-adjacent subcarriers to the users may be reduced.

FIGS. 23 and 24 are exemplary graphs showing the effect of user velocity on system performance, according to certain embodiments. FIG. 23 illustrates the effect of the user velocity (v) on the system's performance for a single user, and FIG. 24 illustrates the effect of the user velocity (v) on the system's performance for three users. In addition, for purposes of the simulation, perfect transmitter/receiver oscillator's synchronization is assumed when evaluating the performance under Doppler Effect. Three velocities are chosen for comparison, e.g., low (3 km/h), medium (30 km/h) and high (300 km/h). For the case of three users in FIG. 24, unequal velocities are assigned to the users and these are v₁, v₂ and v₃ for user 1, user 2 and user 3, respectively. As can be observed from FIGS. 23 and 24, the RLS AFD-DFE described herein is robust to the Doppler Effect, and the RLS AFD-DFE can perform above a predetermined threshold when subjected to the Doppler Effect. However, as the SNR increases, the system performance deteriorates as the Doppler frequency increases due to interference from adjacent subcarriers.

FIG. 25 is an exemplary graph of the effect of user velocity on SFBC SC-FDMA systems, according to certain embodiments. In some implementations, a 2-slot interleave mapping is used where two consecutive subcarriers are assigned to one user and this is to satisfy Λ_(io)=Λ_(ie) for i=1, 2 with an assumption that the channel does not change over two consecutive subcarriers. SFBC can be used with single user and the effect of the users's velocity on the system's performance is shown in FIG. 26. In MIMO scenarios, independent 6-path Rayleigh fading channels can be used for each transmit/receive antenna pair. FIG. 25 shows improvement in performance can be obtained through the use of SFBC with 2-slot interleaving.

In addition, the performance of a four-transmit/two-receive antenna scenario when three users share the same frequency band and time slot with three other different users, is shown in FIG. 26. All users can be assumed to be moving with equal velocities, v. From the FIG. 26, it can be seen that the AFD-DFE can separate co-channel users without sacrificing performance.

A system which includes the features in the foregoing description provides numerous advantages. The AFD-DFE equalization techniques described in the present disclosure have a better performance over other equalization techniques. Thus, the present disclosure provides an improvement to the technical field of communication. For example, it improves the performance of mobile phone and data terminals that use LTE standard. In addition, the present disclosure has the advantage of computational complexity of the equalizer. Thus, the present disclosure improve the functioning of the equalizer by increasing processing speed, decreasing power consumption and resulting in a chip area reduction.

Obviously, numerous modifications and variations are possible in light of the above teachings. It is therefore to be understood that within the scope of the appended claims, the invention may be practiced otherwise than as specifically described herein.

Thus, the foregoing discussion discloses and describes merely exemplary embodiments of the present invention. As will be understood by those skilled in the art, the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting of the scope of the invention, as well as other claims. The disclosure, including any readily discernible variants of the teachings herein, define, in part, the scope of the foregoing claim terminology such that no inventive subject matter is dedicated to the public. 

The invention claimed is:
 1. A wireless device communications system, comprising: a wireless device comprising a transmitter to transmit an RF signal, and a base station comprising a receiver to receive the RF signal, wherein the wireless device comprises circuitry configured to generate a plurality of signal matrices based on received signals, determine feedorward coefficients for an adaptive frequency-domain decision feedback equalizer (AFD-DFE) based in part on the signal matrices, generate a plurality of decision matrices based on transformed feedforward coefficients, determine feedback coefficients for the AFD-DFE based in part on the plurality of decision matrices, generate a first matrix having the plurality of signal matrices and the decision matrices, generate a second matrix including the feedforward coefficients and a third matrix including the feedback coefficients of the AFD-DFE, determine an equalizer output as a function of the first matrix, the second matrix and the third matrix such that the function includes adding a first product comprising the plurality of signal matrices premultiplied by a variation of the second matrix and a second product comprising the plurality of decision matrices being premultiplied by a variation of the third matrix, update the feedforward and feedback coefficients of the AFD-DFE based on the equalizer output, equalize the received signals using the equalizer output, and decode the equalized signals.
 2. The wireless device communications system of claim 1, wherein the circuitry is further configured to determine the feedforward and feedback coefficients based on a recursive least squares (RLS) calculation.
 3. The wireless device communications system of claim 2, wherein the circuitry is further configured to determine the RLS calculation with at least one constraint.
 4. The wireless device communications system of claim 1, wherein the circuitry is further configured to apply a three-tap AFD-DFE to the received signal.
 5. The wireless device communications system of claim 1, wherein the circuitry is further configured to operate in a training mode wherein the feedforward and feedback coefficients of the AFD-DFE are updated based on a training signal.
 6. The wireless device communications system of claim 5, wherein the circuitry is further configured to update the feedforward and feedback coefficients of the AFD-DFE based on a difference between the training signal and the equalizer output.
 7. The wireless device communications system of claim 6, wherein the circuitry is further configured to determine that convergence has occurred when the difference between the training signal and the equalizer output is less than a predetermined threshold.
 8. The wireless device communications system of claim 1, wherein the circuitry is further configured to operate in a decision-directed mode when convergence of the feedforward and feedback filter coefficients of the AFD-DFE has occurred during a training mode.
 9. The wireless device communications system of claim 8, wherein the circuitry is further configured to update the feedforward and feedback coefficients of the AFD-DFE based on a difference between the equalizer output and a desired response based on previous equalizer iterations.
 10. The wireless device communications system of claim 1, wherein the circuitry is further configured to operate in a reduced training mode wherein a number of symbols in a training signal used to update the feedforward and feedback filter coefficients of the AFD-DFE is reduced.
 11. The wireless device communications system of claim 1, wherein the circuitry is further configured to determine the equalizer output for Single-Input Single-Output (SISO) Single-Carrier Frequency Division Multiple Access (SC-FDMA) signals.
 12. The wireless device communications system of claim 1, wherein the circuitry is further configured to determine the equalizer output for Multiple-Input Multiple-Output (MIMO) SC-FDMA signals.
 13. The wireless device communications system of claim 12, wherein the MIMO SC-FDMA signals include spatially multiplexed (SM) SC-FDMA signals and space-frequency block coding (SFBC) signals including at least one of conventional SFBC, embedded SFBC, and hybrid SFBC signals.
 14. The wireless device communications system of claim 1, wherein the circuitry is further configured to determine the equalizer output within a predetermined performance threshold when carrier frequency offset is present.
 15. The wireless device communications system of claim 1, wherein the circuitry is further configured to determine the equalizer output within a predetermined performance threshold when Doppler is present.
 16. The wireless device communications system of claim 1, wherein the circuitry is further configured to update the feedforward and feedback coefficients of the AFD-DFE by performing one or more scalar multiplications corresponding to at least one of a number of data symbols in the received signal and a number of transmit antennas.
 17. The wireless device communications system of claim 1, wherein the circuitry is further configured to determine the equalizer output when channel state information is absent. 